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Hence, the ratio of the number n, per unit volume at height z, to the 
number nz per unit volume at height z2 is given by 
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4.6 Phase Space and the Maxwell-Boltzmann Distribution 


In the preceding section we have derived an important result involving 
potential energy in an isuthermal atmosphere. We now seek more gen- 
eral relations involving kinetic as well as potential energy. Following 
Maxwell, in Sec. 4.4 we began our discussion of gases in terms of a velocity 
space. However, there are advaniages in changing to a momentum 
space in anticipation of quantum mechanics. This can be done in the 
nonrelativistic case by replacing v by p/m, where p is the momentum. 
(Note that up to this point in this chapter p has stood for pressure.) 
Classically the state of a system of noninteracting particles is 
specified at any time when the pusition (x,y,z) and the momentum 
(pz,Py,p:) of each particle are known, If we introduce the concept of a 
six-dimensional phase space, we can locate each particle in terms of six 
coordinates (x,y,z,p.,p,,p:). Thus, phase space consists of coordinate 
space and momentum space taken together. 5. 
For an ideal isothermal gas, not qubject to any external force field, 
the probability that the phase point for any one particle falls into a ` 
volume of phase space dp, dp, dp; dz dy dz is the product of the proba- 
bility that its momentum lies in the element dp. dp, dp. of momentum 
Space and the probability that it is in a volume dz dy dz of coordinate 
Space. The latter is (dz dy dz)/V, where V is the volume of the gas, 


while the former is (p°) dp- dp, dp., which comes from Eq. (4.7). 
Therefore, 


®(2,Y,2,D2,Dy,p:) dps dp, dp- dz dy dz 
en (ns*tp,?+7./24T dp, dp, dp; dx dy dz (4.16) 
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The exponential term is equivalent to e~*/#7, where K = p?/2m is the 
kinetic energy. If we consider any two cells in phase space, the ratio 
of the number of molecules per unit volume n; in cell 1 to the number 
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-per unit volume n: in cell 2 is 
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a result to be compared with that of Eq. (4.15). 


In each case the ratio 
jg E7~SEIKT. 


, Where AE is the appropriate difference in energy. 
In an isothermal atmosphere subject to a uniform gravitation field, 
the probability that a given molecule is a volume element dr dy dz of 
real space is no longer (dz dy dz)/V, since the particle is less likely to 
be in a region of high potential energy than in one of low. 


e. The proba- 
bility is“e-?>T (dr dy dz)/CV, where C is the constant such that 


fff e-PikT dx dy dz = CV. Under these circumstances the probability 


space 


that any one molecule lies in a volume of phase space dp. dp, dp. dx dy dz 
is 


O(2,Y,2,Ps,Py,P:) dp: dp, dp, dx dy dz 
i A = e-K/kT dp. dp, dp: e—?'*T dx dy dz 


(2rmkT)! CV 
p e~K+P)/KT dp, dp, dp. dx dy dz A 
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The sum K + P is the total energy Æ of the ideal gas molecule. x If; 
Eq. (4.18) is multiplied by the total number of molecules NV, the resulting 
term is the probable number of molecules dN in the infinitesimal 
volume element dp. dp, dp. dx dy dz in phase space, viz., 


e—EIkT 


dN=N @amkT)iCV 


dp: dp, dp: dx dy dz (4.19) 
The ratio of the number of particles ni expected in a small volume 
element gı = Ap: Ap, Ap. Ax Ay Az in phase space to the number nə 
in another small volume element gz is 
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In Chap. 20 we shall derive the Maxwell-Boltzmann distribution func- 
tion from elementary statistical mechanics and show that Eq. (4.20) is 
the ratio of two such functions. Although the Maxweli-Boltzmann rela- 
tions derived above have many uses, we shall find that they are not 
generally applicable to elementary particles such as electrons. 
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Preface 


Although the spectacular growth of physics in the past forty years 
has led to an extensive recasting of the material in Richtmyer’s first 
edition and to a myriad of additions, subsequent revisions have retained 
so much of his goals and spirit that excerpts from his original preface 
are still pertinent. The present edition reflects the labors of several 
authors but Richtmyer’s words, as appropriate for the sixth edition as 
they were for the first, speak for all. In 1928 he wrote: 

“The purpose of this book is, frankly, pedagogical. The author has 
attempted to present such a discussion of the origin, development, and 
present status of some of the more important concepts of physics, classical 
as well as modern, as will give to the student a correct perspective of the 
growth and present trend of physics as a whole. Such a perspective 
is a necessary basis—so the author, at.least, believes—for a more intensive 
study of any of the various subdivisions of the subject. While for the 
student whose interests are cultural, or who is to enter any of the pro- 
fessions directly or indirectly related to physics, such as engineering, 
chemistry, astronomy, or mathematics, arn account of modern physics 
which gives the origin of current theories is likely to be quite as interesting 
and valuable as in a categorical statement of the theories themselves. 
Indeed, in all branches of human knowledge the ‘why’ is an absolutely 
indispensable accompaniment to the ‘what.’ ‘Why?’ is the proverbial 
question of childhood. ‘Why?’ inquires the thoughiful (!) student in 
classroom or lecture hall. ‘Why?’ demands the venerable scientist 
when listening to an exposition of views held by a colleague. Accord- 
ingly, if this book seems to lay somewhat greater emphasis-on matters 
which are frequently regarded as historical, or, if here and there a classical 
experiment is described in greater detail than is customary, it is with a 
desire to recognize the importance of ‘why.’ 

“If one were to attempt to answer all of the ‘why’s’ raised by an 
intelligent auditor in answer to a categorical statement, such as, ‘The 
atom of oxygen is composed of eight electrons surrounding a‘nucleus con- 
taining four alpha particles,’ one would have to expound a large part of 
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physical science from Copernicus to Rutherford and Bohr. To attempt 
a statement of even the more important concepts, hypotheses, and laws 
of modern physics and of their origin and development would be an 
encyclopedic task which, at least in so far as concerns the aim of this 
book, would fall of its own weight. Accordingly, it has been necessary to 
select those parts of the subject which best serve our purpose. This 
selection, as well as the method of presentation, has been based upon the 
experience gained in giving the above-mentioned lectures to numerous 
groups of students. Many very important developments, particularly 
the more recent ones, either have been omitted entirely or have been 
given only a passing comment. And even in those parts of the subject 
which have been discussed, there has been no attempt to bring the dis- 
cussion strictly up to date. Indeed, with the present rapid growth of 
physics, it would be quite impossible for any book, even a special treatise, 
to be strictly up to date. Happily, for our purpose, up-to-date-ness is 
not an imperative requisite, since it is assumed that the student who 
wishes the lates! knowledge will consult the current periodicals. 

“In this connection, it should be emphasized that this book is an 
introduction to modern physical theories and is intended neither as a com- 
pendium of information nor as a critical account of any of the subjects 
discussed. In preparing the manuscript, the author has consulted freely 
the many very excellent texts which deal with the various special topics. 
Save for here and there a very minor item, or an occasional novelty in 
presentation, the book makes no claim to originality, except, perhaps, as 
regards the viewpoint from which some parts have been written. 

“Tt ig assumed that the student is familiar with the elementary prin- 
ciples of calculus, for no account of modern physics can dispense with at 
least a limited amount of mathematical discussion, if for no other reason 
than to emphasize the fact that, in the progress of physics, theory and 
experiment have gone hand in hand. Partly, however, for the sake of 
brevity and partly in the attempt always to keep the underlying physical 
principles in the foreground, numerous ‘short cuts’ and simplifications, 
some of them perhaps rather questionable from a precise standpoint, 
have been introduced. These elisions should cause no confusion. . . - 

“There is no more fascinating story than an account of the develop- 

ment of physical science as a whole. (Any scientist would probably make 
the same statement about his own science!) Such a study leads to certain 
broad generalizations which are of outstanding importance in evaluating 
current theories and concepts. For example, one finds that, taken by 
and large, the evolution of physics has been characterized by continuity. 
That is to say: With a few exceptions, the ideas, concepts, and laws of 
physics have evolved gradually; only here and there do we find onran u, 
discontinuities. The discovery of photoelectricity, of x-rays, and © 
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radioactivity represent such discontinuities and are correctly designated 
‘discoveries.’ But we must use ‘discover’ in a quite different sense 
when we say that J. J. Thomson ‘discovered’ the electron. The history 
of the electron goes back at least to Faraday. Thomson’s experiments 
are all the: more brilliant because he succeeded in demonstrating, by 
direct experiment, the existence of something, evidence concerning which 
had been previously indirect. Then, there are the respective roles played 
by qualitative and by quantitative work. Numerous important dis- 
coveries have been made ‘by investigating the next decimal place.’ 
Witness the discovery of argon. And ever since Kepler proved that the 
orbits of the planets are ellipses, relations expressible in quantilative form 
have carried greater weight than those which could be stated only 
qualitatively. For example, Rumford’s experiments on the production 
of heat by mechanical means were suggestive. But Joule’s measurement 
of the mechanical equivalent of heat was convincing. If, directly, or 
indirectly by inference, the author has succeeded here and there in the 
text in pointing out such generalizations as these, one more object which 
he has had in mind will have been accomplished.” 

With the objective of making the sixth edition a more useful inter- 
mediate textbook, the material has been organized into shorter chapters 
to which problem sets have been added. New chapters have been pre- 
pared on relativity, wave mechanics, and solid-state physics. To cover 
the entire book would require a full year’s course, but appropriate 
selection of chapters will make the book a suitable text for courses as 
short as one quarter or one semester. In many chapters topics which are 
often omitted in shorter courses are set off on a gray background. To 
keep the size of the hook within reasonable bounds, many subjects 
treated in the fifth edition are omitted or discussed more briefly; the 
largest deletion is virtually the entire chapter on cosmic rays. 

The general development has been carried out using rationalized 
MKSA units, but there has been no hesitancy to use units such as the 
gram, the electron volt, or the Angstrom unit whenever they seemed more 
appropriate (e.g., the tradional use of cm~? for wavenumbers is retained). 
In general, symbols and abbreviations follow the system proposed by 
the commissions of the International Union of Pure and Applied Physics. 

After the death of Professor Richtmyer in 1939, Professor E. H. 
Kennard undertook the preparation of the third and fourth editions with 
Professor L. G. Parratt collaborating on the chapter on x-rays. Pro- 
fessor T. Lauritsen joined Kennard in bringing out the fifth edition. 
When time came for a sixth edition, Dr. Kennard’s health was failing (his 
death came in January, 1968) and Professor Lauritsen declined the 
invitation to assume responsibility for the revision. Although he read 
critically many chapters of the present manuscript and proposed numer- 
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ous improvements, Professor Lauritsen elected to restrict his participa- 
tion in the present edition and, at his own request, he is not listed as an 
author. 

In addition to my indebtedness to Professor Lauritsen, I acknowl- 
edge help from many persons. First of all, my wife Elaine typed and 
helped with the physical preparation of the entire manuscript. Students 
who studied selected chapters in preliminary form have made valuable 
suggestions and caught many errors. Several colleagues have been 
very helpful in reviewing material in the areas of their specialties; in 
particular, I thank Professor R. L. Kelly for his aid on the chapters on 
atomic spectroscopy, Professors K. E. Woehler and W. B. Zeleny for 
assistance on relativity, and Professor G. E. Schacher for constructive 
comments on the chapters on the solid state. 

In spite of great effort to avoid errors it is inevitable that some will 
creep into any book. I will be grateful to an 


yone who finds and reports 
mistakes and deficiencies in this book. 


John N. Cooper 
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chapter one 


The Heritage of Modern Physics 


As a prelude to understanding modern physics, one should be familiar 
with the main events in the evolution of nalural philosophy over the centuries. 
Physicists owe il lo their science to possess sufficient knowledge of its history 
for a correct perspective of its development and present-day importance. 
That very history, the lives of the men whose labors formed our subject, and 
the way physics has molded human thought and contributed to modern 
civilization claim our first allention. 


1.1 Classical vs. Modern Physics 


Taken literally, modern physics means, of course, the sum total of knowl- 
edge included under the head of present-day physics. In this sense, 
the physics of 1890 is still modern; very few statements made in a good 
physics text of that year would have to be changed today as untrue. 
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On the other hand, since 1890, there have been enormous advances in 
physics, some of which have necessitated radical modifications in certain 
theories that had seemed to be strongly supported by experimental 
evideNce. In particular, the metaphysical assumptions of the nine- 
teenth century proved to be an inadequate foundation for the growing 
structure of natural philosophy. New ideas were required, differing 
sharply with the older ones in special areas where the latter failed but 
still yielding the same predictions where the older ideas had been success- 
ful. The extensive physics of 1890, which includes analytical mechanics, 
thermodynamics, and maxwellian electrodynamics, is often called classi- 
cal physics. Then “modern physics” implies the quantitative descrip- 
tion of phenomena which could not be understood in terms of classical 
assumptions. 

Of course, there is no sharp demarcation between classical and 
modern physics; the latter grows out of the former and is nourished by 
it. For example, few physicists in 1890 questioned the wave theory of 
light. Its triumph over the corpuscular theory seemed to be complete, 
particularly after the brilliant experiments of Hertz in 1887, which 
demonstrated the fundamental soundness of Maxwell’s electromagnetic 
theory of light. And yet, by an irony of fate which makes the story 
of modern physics full of dramatic situations, these very experiments 
brought to light a new phenomenon, the photoelectric effect, which 
played an important role in establishing the quantum theory. In many 
of its aspects the quantum theory seems diametrically opposed to the 
wave theory of light; indeed, the reconciliation of these two theories, 
each based on incontrovertible evidence, was one of the great problems 
of the first quarter of the twentieth century. 

For the brief historical sketch which follows, the evolution of 
premodern physics is arbitrarily divided into three periods. 

The first period extends from the earliest times up to about A.D. 
1550, this date marking approximately the beginning of the experimental 
method. During this long era, there was some advance in the accumula- 
tion of the facts of physics as a result of the observation of natural phe- 
nomena, but the development of extensive physical theories was rendered 
impossible, partly by the speculative, metaphysical nature of the reason- 
ing employed, but more particularly by the almost complete absence of 

experiment to test the correctness of such theories as were proposed. 
The main characteristic of this period, therefore, is the absence of system- 
atic experiment. 

The second period extends from A.D. 1550 to 1800. Numerous 
basic advances were made during this period—by such men as Gilbert, 
Galileo, Newton, Huygens, Boyle—but its most important characteristic 
is the development and the firm establishment of the experimental method as a 
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recognized and accepted means of scientific inquiry. This period was 
inaugurated by the classic work of Galileo (1564-1642); but it took 
nearly two centuries more to bring universal recognition of the following 
basic principle: 

Science can advance only so far us theories, themselves based upon experiment, are 


accepted or rejected according as they either agree with, or are contrary to, other experiments 
devised to check the theory. 


The third period, 1800 to 1890, is characterized by the development 
of what is now called classical physics in contrast with the quantum 
physics of the present century. The experiments of Count Rumford 
and Joule led to our present kinetic theory of heat. The observations of 
Thomas Young (1802) and his proposal of the principle of interference 
(of two beams of light) resulted ultimately in the triumph of Huygens’ 
wave theory of light. The researches of Faraday and others gave 
Maxwell the material for the crowning achievement of this period, the 
electromagnetic theory. 


First Period: Earliest Times to A.D. 1550 


1.2 The Greeks 


Relatively speaking, the contributions made by the Greeks to the natural 
sciences were far less than their contributions to mathematics, literature, 
art, and metaphysics. Nevertheless, in spite of their vague and misty 
philosophizing concerning natural phenomena and in spite of their general 
failure to test theory by experiment, the Greeks gave to the world much 
of the physics that was known up to A.D. 1400. In their writings, one 
finds the germ of such fundamental concepts as the conservation of 
matter, inertia, atomic theory, the finite velocity of light, and the like. 


1.3 Thales of Miletus (640-546 B.C.') 


According to Aristotle, Thales was acquainted with the attractive power 
of magnets and of rubbed amber. He is often said to have discovered 
the inclination of the ecliptic and the spherical shape of the earth, but 
Aristotle credited him with the doctrine that the earth was cylindrical 
in shape and rested on water. 


1 Few of these ancient dates are known with certainty, and authorities differ. 
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1.4 Pythagoras (580-500 B.C.) 


One of the greatest of the early Greek philosophers and the founder of 
the Pythagorean school, he held that the earth is spherical, although 
the basis of this belief is not known; it may have been based on the idea 
that the sphere is the most perfect of all figures. Pythagoras believed 
that the entire universe was spherical in shape with the earth at its 
center and that the sun, stars, and planets moved in independent circles 
around the earth as a center. 


1.5 Anaxagoras (500-428 B.C.) and Empedocles (484-424 B.C.) 


According to Plato, Anaxagoras neglected his possessions in order to 
devote himself to science. He is credited with the view that the moon 
does not shine by its own light but that “the sun places the brightness 
in the moon” and “the moon is eclipsed through the interposition of the 
earth.” Also, “The sun is eclipsed at new moon through the interposi- 


tion of the moon.” Anaxagoras was accused of impiety, however, 
because he taught that the sun was a red-hot stone and that the moon 


was simply earth, and for holding this doctrine he was banished from 
Athens. 

To Anaxagoras is due the germ of the atomic hypothesis of Democ- 
ritus, who lived in the next generation. Anaxagoras denied the conten- 
tion of the earlier Greeks regarding the creation or destruction of matter. 
He taught that changes in matter are due to combinations or separa- 
tions of small, invisible particles (spermala). These particles them- 
selves were conceived to be unchangeable and imperishable but different 
from each other in form, color, and taste. This doctrine foreshadowed 
the law of the conservation of matter. 

Empedocles, on the other hand, reduce 
earth, water, air, and fire—throu 
which the All exists. He also hel 


d the elements to four— 
gh combinations and separations of 
d that light is due to the emission by 
mall particles that enter the eye and 
are then returned from the eye to the body, the two “streams” giving 
rise to the sense of form, color, etc. 

According to Aristotle, Empedocles believed that light “takes time 
to travel from one point to another.” This idea was rejected by Aristotle, 
who stated that “though a movement of light might elude our observa- 
tion within a short distance, that it should do so all the way from east to 
west is too much to assume.” 
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1.6 Democritus (460-370 B.C.) 


He gave more definite form to the atomic hypothesis of Anaxagoras by 
postulating that the universe consists of empty space and an (almost) 
infinite number of indivisible and invisible particles which differ from 
each other in form, position, and arrangement. In support of this 
hypothesis, Democritus argued that the creation of matter is impossible, 
since nothing can come from nothing and, further, nothing which is can 
cease to exist. Aristotle put the argument in this form: “If, then, some 
one of the things which are is constantly disappearing, why has not the 
whole of what is been used up long ago and vanished away?” But 
Aristotle rejected the atomic hypothesis. 


1.7 Aristotle (384-332 B.C.) 


A pupil of the philosopher Plato, Aristotle contributed so much to all 
branches of knowledge—logic, rhetoric, ethics, metaphysics, psychology, 
natural science—that it is difficult to sift out that which is germane to 
a brief history of physics. Perhaps the most important single fact is 
the tremendous influence which, as a result of his intellectual brilliance 
and achievements in many branches of learning, he exerted for many 
succeeding centuries in all branches, physics included. Viewed from 
our twentieth-century vantage point, however, not a little of his reasoning 
concerning the physical universe sounds like piffle. 

Yet Aristotle frequently calls in observed facts to substantiate his 
speculations. For example, in “De Caelo” (Book II, Cap. XIV), after 
proving, by a more or less abstract argument, that the earth is spherical, 
he says: 

The evidence of all the senses further corroborates this. How else 

would eclipses of the moon show segments as we see them? .. . 

since it is the interposition of the earth that makes the eclipse, the 


form of this line {i.e., the earth’s shadow on the moon] will be caused 
by the form of the earth’s surface, which is therefore spherical. 


He also points to the apparent change in altitude of the stars as one 
travels north or south and concludes that ‘‘not only is the earth circular, 
but it is a circle of no great size.” 

Indeed, in theory if not in his own practice, Aristotle places great 
emphasis on the importance of facts in connection with scientific develop- 
ment. In a paragraph in “De Generatione et Corruptione” (Book I, 


Cap. II), he says: 
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Lack of experience diminishes our power of taking a comprehensive 
view of the admitted facts. Hence, those who dwell in intimate asso- 
ciation with nature and ils phenomena grow more and more able to 
formulate, as the foundation of their theories, principles such as to 
admit of a wide and coherent development; while those whom devo- 
tion lo abstract discussions has rendered unobservant of the facts are 
loo ready to dogmatize on the basis of a few observations. 


This is surely good doctrine even for twentieth-century scientists! 

An attempt to summarize Aristotle’s views on physics is beyond the 
scope of this book, but reference may be made to two of his doctrines 
because of their bearing upon subsequent history. 

The first is his supposed views on falling bodies. The statement is 
commonly made that Aristotle held that a heavy body would fall from 
a given height with greater velocity than a light body. It is difficult 
to be sure from Aristotle’s extant writings just what he actually held 
in regard to this point. The passages that scem to refer to it occur in 


the course of his arguments against the possibility of the existence of a 
void. For example, he states: 


We see the same weight or body moving faster than another for two 
reasons, either because there is a difference in what it moves through, 
as between water, air, and earth, or because, other things being equal, 
the moving body differs from the other owing to excess of weight or 
lightness. . . . And always, by so much as the medium is more 
incorporeal and less resistant and more easily divided, the faster will 
be the movement. 


Here, as elsewhere, Aristotle speaks always of movement of a body 
through a medium. He may have meant lerminal velocity, such as the 
constant velocity of rain drops as they approach the ground. The 
heavier drops do fall faster. It seems probable that Aristotle, believing 
that a medium of some sort must always be present, was unaware of 
such distinctions as that between terminal and nonterminal velocity 
and actually did believe that in all stages of its motion the heavier body 
falls faster. 
The second doctrine referred to is that of the motion of the earth, 
sun, and planets. In his “De Caclo” (Book II, Cap. XIV), after a series 
of abstract arguments, in the course of which he states that “heavy 
hodies forcibly thrown quite straight upward return to the point from 
which they started even if they be thrown to an infinite [!] distance,” 
Aristotle concludes “that the earth docs not move and does not lic 
elsewhere than at the center.” - He supposed that the sun, planets, and 
stars are carried by a series of concentric spheres which revolve around 
the earth as a center. 


. 5 :; ; u ks 
1 This and the following quotation from Aristotle are taken from “The Wor 
of Aristotle Translated into English,” vol. II, Clarendon Press, Oxford, 1930. 
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1.8 Aristarchus (310-230 B.C.) 


He enunciated a cosmogony identical with that proposed by Copernicus . 
nearly 2000 years later. No mention of this hypothesis is made in his 
only extant work, “On the Sizes and Distances of the Sun and Moon,” 
but Archimedes tells us, in a book called “The Sand-reckoner,” that 
“Aristarchus of Samos brought out a book” containing the hypothesis 
“that the fixed stars and the sun remained unmoved; that the earth 
revolves around the sun in the circumference of a circle, the sun lying 
in the middle of the orbit’; and that “the sphere of the fixed stars” 
is very great compared with the circle in which the earth revolves. 
The prestige of Aristotle was too great, however, and the. geocentric 
hypothesis that he supported was so completely satisfactory to the 
ancient mind that Aristarchus’ theory was practically lost for nearly 
2000 years. 


1.9 Archimedes (287-212 B.C.) 


Probably the most noted physicist of antiquity, Archimedes was a man 
of great ability in mathematics as well as a practical engineer. He 
invented the endless screw, the water screw, a pulley block, and the 
burning mirror. He was a founder of statics, and in his book “On 
Floating Bodies” he laid the foundations of hydrostatics, including his 
famous principle, “A solid heavier than a fluid will, if placed in it, descend 
to the bottom of the fluid, and the solid will, when weighed in the fluid, 
be lighter than its true weight by the weight of the fluid displaced.” 


1.10 From the Greeks to Copernicus 


To give but a passing comment to the 17 centuries between Archimedes 
and Copernicus would seem to give the reader the false impression that 
no developments of moment occurred during that Jong period. During 
ancient times Ptolemy of Alexandria (a.p. 70-147) collected the optical 
knowledge of his time in a book in which he discussed, among other 
things, reflection from mirrors—plane, convex, concave—and, particu- 
larly, refraction, which Ptolemy evidently studied experimentally. He 
gave, in degrees, relative values of angles of incidence and of refraction 
for air-water, air-glass, and water-glass surfaces and described an appara- 
tus by which he determined these quantities; he stated that for a given 
interface these two angles are proportional. He also mentioned atmos- 
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phericé refraction as affecting the apparent position of stars. He invented 
a complicated theory of the motions of the planets in their orbits about 
the earth in order to explain their apparent motions among the stars. 

From Ptolemy to the Arabian Alhazen is a span of nine centuries— 
twice the total lapse of time from the discovery of America to the 
present—during which there was stagnation in almost all intellectual 
pursuits. But about the eighth century the Arabs began to cultivate 
chemistry, mathematics, and astronomy, in part by translating into 
Arabic the works of the Greeks but also by making original contribu- 
tions. About the year 1000, Alhazen produced a work on optics in 
seven books. This treatise sets forth a clear description of the optical 
system of the eye, discusses the laws of- concave and convex mirrors, 
both spherical and cylindrical, and carries the subject of refraction a 
little further by recognizing that the proportionality between the angles 
of incidence and refraction holds only for small angles. 

During the next 500 years, relatively few advances in physics were 
made. Roger Bacon (1214-1294), English philosopher, scientist, and 
Franciscan monk, taught that in order to learn thé secrets of nature 
we must first observe. He believed in mathematics and in deductive 
science, but he clearly realized that only as scientific conclusions, are 
based on observed phenomena and tested by experiment can useful 
knowledge result. 

About the same time Petrus Peregrinus recognized that magnetic 
poles are of two kinds, like poles repelling and unlike attracting each 
other. 

Then there was Leonardo da Vinci (1452-1519), Italian painter, 
architect, sculptor, engineer, and philosopher, whose greatness as a 
scientist has come to be appreciated only in recent years. Since his 
works were left in manuscript form and were probably not widely known 
among his contemporaries, his influence on early science is compara- 
tively insignificant. His belief in the value of experiment is worthy of 
the twentieth century: “Before making this case a general rule, test it 
by experiment two or three times and see if the experiment produces the 
same effect.” Although expressed in the vague language of his time, 

some of his ideas concerning what we now refer to as force, inertia, 
acceleration, the laws of motion, etc., were qualitatively correct. Con- . 
cerning perpetual motion, he wrote: “Oh, speculators on perpetual 
motion, how many vain projects of the like character you have created! 
Go and be the companions of the searchers after gold.” Rejecting the 
Ptolemaic theory, he held that ‘the sun does not move.” That he was 
not persecuted or even burned at the stake, as Bruno was a century later, 
for holding such revolutionary and therefore (!) heretical views is prob- 
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ably due to the fact that his doctrines were given little publicity; for, 
holding no academic position, he did not teach, and he published nothing. 
j Finally, in the sixteenth century, the full force of that period of 
intense intellectual activity known as the Renaissance began to be felt 
in the field of physics. Then came such men as Copernicus, Tycho, 
Kepler, Galileo, and Newton, who with their contemporaries and col- 
leagues, in a space of hardly more than a century, broke the spell of 
Aristotle and made possible the beginnings of modern experimental 
science. Insofar as the heliocentric theory completely revolutionized 
man’s conception of the universe and his place in it, it is quite correct to 
regard the work of Copernicus as beginning a new era in scientific thought. 
But had it not been for other discoveries coming immediately after 
Copernicus, such as the telescope, Kepler's laws, Galileo’s famous experi- 
ments on falling bodies, and many others, it is quite possible that the 
theory of Copernicus would have had the same fate as that of Aristarchus 
centuries earlier. It is, therefore, fitting to regard the birth of the 
Copernican theory as closing the first period in the history of physics. 


1.11 The Copernican System 


Copernicus (1473-1543), a younger contemporary of Columbus, spent 
most of his life as one of the leading canons in the monastery at Frauen- 
burg, near the mouth of the Vistula. His theory of the universe is set 
forth in his “De Revolutionibus Orbium Coelestium,” published near 
the close of his life. 

Copernicus perceived that, by assuming that the earth is a planet 
like the others and that all the planets move in circles around the sun, 
a great simplification, both philosophical and mathematical, could be 
made with regard to the world system. He could thus easily account 
for the seasons and for the apparent retrograde motion, at times, of 
the planets. The rotation of the earth on its axis causes the apparent 
daily motion of the sun, moon, and stars; and he pointed out that, 
probably, the stars are too far away for any motion of the earth to 
effect their apparent positions. He gave the correct order of the planets 
from the sun outward. 

Whatever the system as proposed by Copernicus lacked quantita- 
tively, it was correct, in its main outline, qualitatively. Its reasonable- 
ness set a few men thinking and did much to usher in a new era in science, 
an era that could come only when truth could have the opportunity 
of standing alone, unaided or unhindered by the “authority” of 2000 


years. 
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Second Period (A.D. 1550-1800): Rise of 
the Experimental Method 


1.12 Galileo Galilei (1564-1642) 


Galileo is widely regarded as the father of experimental physics. e 
be sure, physics has grandfathers and still more remote ancestors, te 
none of them gave more than he. Galileo was descended from a noble 
family, and it is probable that he inherited from his father the spirit 
of free inquiry which characterized his life; for, in the writings of the 
elder Galileo, who was well educated and an accomplished musician, 
is the statement: “It appears to me that they who in proof of any asser- 
tion rely simply on the weight of eS: without adducing any 
i it, act very absurdly. 

PEEN A tana EE of Vallombrosa, near Florence, the 
young Galileo excelled in the classics and literature and was something 
of a poet, musician, and art critic. He also showed an aptitude for 
science and exhibited considerable mechanical inventiveness. At the age 
of seventeen he was sent to the University of Pisa to study medicine. It 
was here that he made his first discovery and invention. One day, in 
1581, he noticed the regular oscillations of the great hanging lamp in the 
cathedral at Pisa. Although the amplitude of these oscillations became 
less and less, they were all performed in the same time, as he determined 
by counting his pulse. Turning the process around, he invented a 
pulsometer, a ball-and-string, i.e., simple pendulum, device, whose length, 
when adjusted to synchronism with the pulse, was a measure of its 
frequency. 

But the urge toward mathematics and science overe 
niary advantages of a medical career; at twenty-six, 
professor of mathematics at Pisa. Here he b 
tion of the mechanical doctrines of Aristotle. 
experiment that Aristotle was in error in man 


totle was commonly understood to teach that 
than a light one. This doctrine had been qu 
actual test, by various writers, e.g., by 
and by Benedetto Varchi in the 
the authority of Aristotle had 


ame the pecu- 
Galileo became 
egan a systematic investiga- 
He soon demonstrated by 
y of his assertions. Aris- 
a heavy body falls faster 
estioned, on the basis of 
Philoponus in the fifth century 
generation before Galileo. Nevertheless 


continued to be accepted. To test the 
point, Galileo is rumored to have tried the famous experiment of dropping 


bodies of unequal weight from the leaning tower of Pisa and finding 
that they all fell with practically equal velocities, While we know 
nothing of the details of the experiment, it is certain that Galileo publi- 
cized Aristotle’s error. 
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Fig. 1.1 Galileo Galilei. 


Then began a persecution which led Galileo to quit Pisa/ In 
1592 he became professor of mathematics at the University of Padua, 
where he remained 18 years, enjoying comparative liberty of thought 
and teaching. His fame as a teacher spread all over Europe, and his 
lectures were crowded. 

In 1608, a Dutch optician, Lippershey, as a result of a chance 
observation of an apprentice, had succeeded in “making distant objects 
appear nearer but inverted” by looking through a tube in which were 
mounted two spectacle lenses. News of this invention reached Galileo 
in June, 1609. Grasping the principle involved, he made a telescope 
and exhibited it in Venice “for more than a month, to the astonishment 
of the chiefs of the republic.” By January, 1610, Galileo had made a 
telescope with a power of 30 diameters,' with which he made a number 
of fundamental discoveries. He saw that the number of fixed stars 
was vastly greater than could be seen by the unaided eye, and thus he 
explained the agelong puzzle of the Milky Way. He saw that the 

1 Galileo's telescopes were similar to the modern opera glass—a double-convex 


(or plano-convex) object glass and a double-concave eyepiece. Thus, they had an 
erect image. 
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planets appeared as luminous disks while the stars still remained points 
of light, and he discovered the satellites of Jupiter. These discoveries 
made Galileo famous. He soon accepted an invitation to return to 
Pisa as “First Mathematician and Philosopher,” at a substantial increase 
in salary, but at a sacrifice of his academic freedom in Padua. Con- 
tinuing his astronomical investigations, he discovered the crescent phases 
of Venus, sunspots and the rotation of the sun, the faculae of the solar 
atmosphere, and the libration of the moon. In 1612, he published his 
“Discourse on Floating Bodies.” 

At first, it seemed as if his fame had silenced all opposition from the 
church. But the support that his discoveries gave to the hated Coper- 
nican theory and his vigorous attacks on Aristotelian philosophy roused 
his enemies to fury, with the result that in 1615 he was hauled before the 
Pope and, under threat of imprisonment and torture, was “enjoined . , , 
to relinquish altogether the said opinion that the sun is the center of the 
world and immovable . . . nor henceforth to hold, teach, or defend it 
in any way ....” Simultaneously, it was decreed that the works of 
Copernicus “be suspended until they be corrected.” Galileo acquiesced 
to these decrees and was allowed to return to Pisa, where he continued 
his researches. 

In 1623, one of Galileo’s friends, Barberini, became Pope Urban 
VIII, from whom Galileo received assurances of “pontifical good will.” 
Thereupon he began the writing of his great book, “Dialogues on the 
Ptolemaic and Copernican Systems.” This was published in 1632, under 
formal permission from the censor. The form of these dialogues is 
ingeniously contrived to abide by the leller of the decree of 1615. Three 

“characters” carry on the discussion: Salviati, a Copernican; Simplicio, 
an Aristotelian; and Sagredo, a witty, impartial, good-natured chairman. 
The dialogues cover four “Days,” during which the arguments for and 
against each system are set forth with apparent impartiality and without 
reaching any slaled conclusion. Nevertheless, the general effect of the 
book was a powerful argument for Copernicanism. 
At the instigation of his enemies Galileo was presently called before 
the Inquisition. He was sixty-seven years old and impaired in health 
and spirit. Bowing to the inevitable, he indicated his “free and unbiased” 
willingness to recant, to “abjure, curse, and detest the said heresies and 
errors and every other error and sect contrary to the Holy Church, 
and he agreed “never more in future to say or assert anything, verbally 
or in writing, which may give rise to a similar suspicion.” Thereafter 
Galileo was a prisoner under suspended sentence, first at Rome, then a 
his home in Arcetri. Here, during the last years of his life, he prepare 
and in 1636 published his “Dialogues on Two New Sciences,” i.e., 


cohesion and motion. 
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The dialogues on ‘‘Motion” sum up Galileo's earlier experiments and 
his more mature deliberations. He states that “‘if the resistance of the 
media be taken away, all matter would descend with equal velocity.” 
He deduces the formulas of uniformly accelerated motion. He shows 
that the path of a projectile is parabolic under suitable limiting condi- 
tions and states that if all resistance were removed, a body projected 
along a horizontal plane would continue to move forever. His work on 
mechanics paved the way for the enunciation by Newton of the famous 
three laws of motion. 


1.13 Tycho Brahe (1546-1601) and Kepler (1571-1630) 


The work of Tycho and Kepler is particularly interesting not only 
because of its direct bearing on the development of physics but more 
particularly because of the mutual dependence of the work of each one 
upon that of the other, a relation common in present-day science. Tycho 
was the. experimentalist who supplied the accurate data upon which 
Kepler built a new theory of planetary motion. Without a Kepler, 
Tycho’s observations would have attracted hardly more than passing 
notice. Kepler, in turn, might have theorized to his heart's content but 
without the accurate data of a Tycho, those theories might ultimately 
have shared the fate of Aristotle's. Sometimes theory precedes, 
sometimes experiment. But neither can get far without the other. 

Tycho Brahe, born of a noble family in Sweden, was educated for a 
career as a statesman, but he developed a consuming interest in astron- 
omy. By observations of his own, he found that the current astronomical 
tables were incorrect.. In 1575, he was put in charge of the observatory 
of Uraniborg by King Frederick II of Denmark, one of his duties being 
to make astrological calculations for the royal family. Here he spent 20 
years making systematic observations of the planets, constructing a star 
catalog, and accumulating other astronomical data, always with the 
highest accuracy that could be attained without a telescope. In 1599, 
he undertook to establish a new observatory at Prague for the German 
emperor, Rudolph II, but in the midst of this work he suddenly died. 

Among Tycho Brahe’s assistants at Prague was a brilliant young 
mathematician, Johann Kepler. He succeeded Tycho as principal 
‘mathematician to the emperor and undertook to carry to completion 
the new set of astronomical tables based on the elaborate observations 
of his predecessor. Kepler remained at Prague until 1612; from then 
until his death, in 1630, he held a professorship at Linz. 

Tycho Brahe had rejected the Copernican system for a geocentric 
system of his own. It is one of the ironies of science that his own data | 
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on planetary motions, taken in support of his own theory, became, in 
the hands of Kepler, the clinching argument for the Copernican gysten. 
Using Tycho Brahe’s observations, Kepler made a special study of the 
motion of Mars. He tried to reconcile the recorded positions of the 
planet by assuming circular orbits for Mars and for cee ae 
various positions of these orbits relative to the Cat fee es si 
Resorting to the Ptolemaic notion of epicycles and de pp a 
some improvement, but still the observed positions differe ie 
computed, in some cases by as much as 8 minutes of arc. Kepler knew 
that the observations could not be in error by that amount. Some new 
i ry motion was necessary. 

Be tha rate eae uniform circular motion and senma tat me 
speed varied inversely as the planet’s distance from ae nis 
assumption is his famous “second law, that the radius reel or rom ihe 
sun to the planet describes equal areas in equal limes. It a gi San 
mately, but still there were systematic errors which exceede | the possible 
errors of observation. Finally, he cast aside the last traditions of the 
Ptolemaic system and tried orbits of other forms, first, an oval path and 
then an ellipse with the sun at one focus. At last, his years of compu- 
tation bore fruit. The path was an ellipse. Theory and observation 
agreed! And one of the most important and far-reaching laws in all 
science had been discovered, all because of a discrepancy of 8 minutes of 
arc betwèen observation and theory! One of the striking things in 
the growth of science is the fact that many fundamental advances have 
come about because of just such discrepancies, frequently very small 
ones, between observation and theory. 

Eventually, Kepler handed down to posterity the three laws of 
planetary motion, which, sweeping away all remnants of the Ptolemaic 
system, paved the way for modern astronomy: 


1. The planets move around the sun in 


orbits which are ellipses with the sun 

at one focus. 
2. The radius vector (from sun to 
3. The squares of the periods of 
Proportional to the cubes of t! 


planet) sweeps over equal areas in equal times. 
revolution of the planets around the sun are 
he semimajor axes of their respective orbits. 
But what makes the planets move? Why do the outer ones go more 
slowly? Is there “one moving intelligence in the sun, the common cen- 
ter, forcing them all around, but those most violently which are nearest?” 
Kepler speculated long on this question and arrived at the idea of an 
attraction acting between any two material bodies. This qualitative 
idea of Kepler’s was later developed by Newton into his quantilalive 
theory of universal gravitation, Kepler himself, however, seems to have 


The Heritage of Modern Physics 15 


had no idea that it is this very attractive force which keeps the planets 
in their orbits. 

Kepler also made substantial contributions in the field of optics. 
He understood the principle of total reflection and how to determine 
what we now call the crilical angle. He studied atmospheric refraction 
as affecting the apparent position of the heavenly bodies and worked 
out an approximate formula to allow for this error for all positions, from 
zenith to horizon. He was the first to propose the meniscus type of 
lens. And he invented the Keplerian telescope, in which a real image is 
formed, thus making possible accurate measurements by means of cross 
hairs in the focal plane of the objective. 


1.14 The Experimental Method Spreads 


The impetus given to science by Galileo, Tycho, and Kepler resulted 
in an ever increasing number of investigators in the generations that 
followed. We mention only a few of them. Of great significance, too, 
is the formation in Europe at about this time of learned societies which 
brought together, for argument and discussion, men of kindred interests. 
The Lincean Society was founded in Italy, in 1603; the Royal Academy 
of Sciences, in France, in 1666; and the Royal Society, in England, in 
1662. Continued improvement of the art of printing facilitated the 
diffusion of scientific knowledge. 

In 1600, William Gilbert, an English physician, published his 
famous work “De Magnete,” based largely upon his own experiments, 
in which he showed the fallacy of such popular fancies as the belief 
that lodestones lost their magnetic power when rubbed with garlic and 
regained it again when smeared with goat’s blood. He was the first 
to recognize that the earth is a great magnet, and he actually magnetized 
a small sphere of iron and showed that it produced a magnetic field 
similar to that of the earth. 

Among other workers in magnetism may be mentioned Kircher 
(1601-1680), who, by measuring the force required to pull a piece of 
iron from either pole of a magnet, demonstrated the equality of the two 
poles; Cabeo (1585-1650), who showed that an unmagnelized iron needle 
floated freely on water would place itself along the earth’s magnetic 
meridian; and Gellibrand (1597-1637), who discovered the secular varia- 
tion of the magnetic declination. 

In optics, Scheiner (1575-1650) studied the optics of the eye; 
Snell (1591-1626) discovered the true law of refraction; and Cavalieri 
(1598-1647) gave the correct formula for the focal length of a thin glass 
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lens in terms of the radii of curvature of the two sides. In acoustics, 
Mersenne (1588-1648), after having investigated the laws of vibrating 
strings, determined, in absolute measure, the frequency of a tone. He 
also measured the velocity of sound by observing the time interval 
between the flash of a gun and the arrival of the report. 

In fluid mechanics, Torricelli (1608-1647), studied the flow of 
liquids from orifices, discovered the principle of the barometer, and 
observed variation in barometric height with altitude. Working inde- 
pendently, Guericke (1602-1686) invented the air pump. Pascal (1623- 
1662) measured the difference in barometric height between the base 
and the top of a mountain, correctly explaining the reason for the differ- 
ence, and later announced the famous principle of hydrostatics that 

© bears his name. 

Not only was physics, as a subject, beginning to assume definite 
form, but even the subdivisions such as mechanics, light, sound, etc., 
were beginning to crystallize. Then came a man 


: + ; towering head and shoulders above all his contemporaries, a 
veritable giant among the giants, a man whose intellect and whose 
contributions to knowledge are incomparably greater than those of 


any other scientist of the past, that prince of philosophers, Sir Isaac 
Newton.! 


The other “giants” referred to, contemporaries of Newton, are such men 
as Boyle, Huygens, and Hooke. 


1.15 Sir Isaac Newton (1642-1727) 


Newton was born in the little hamlet of Woolsthorpe, England, in 
1642, less than a year after the death of Galileo. In the public school 
at Grantham, he showed at first no exceptional aptitude for his studies, 
but ultimately he rose to the highest place in the class. Then, at the 
age of fifteen, he was removed from school to assist his widowed mother 
in running the family estate at Woolsthorpe. But he had little taste 
for farming. Rather, he was interested in studying and in devising 
various mechanisms. He made a water clock, waterwheels, sundials, 
and a working model of a windmill. One morning his uncle found him 
under a hedge studying mathematics when he should have been farming. 
Thereupon Newton’s mother wisely decided that an educational career 
was more suitable for her son, and he was sent back to school and in 
1661 to Cambridge. Here his creative genius began to appear. While 
still a student, he discovered the binomial theorem, developed the 
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methods of infinite series, and discovered fluxions, or the differential 
calculus. 

Soon thereafter, an outbreak of the plague closed the university for 
some months, during which time Newton, at the family estate at Wools- 
thorpe, began his speculations on the subject of gravity, which later led 
to his enunciation of the inverse-square law. It was here that the much- 
quoted falling-apple episode is said to have occurred, which is supposed 
to have given Newton the basic idea of universal gravitation. But 
Newton himself makes no mention of the incident, and it seems far more 
probable that at Cambridge he had read Kepler’s qualitative proposal of 
a general principle of gravitation. Certainly, Newton was familiar with 
Kepler’s laws of planetary motion. 

In 1667, Newton returned to Cambridge as Fellow of Trinity. At 
the age, of twenty-six, he was appointed Lucasian Professor of Mathe- 
matics, a chair which he held for nearly 30 years. In 1703, he resigned 
his professorship to devote himself to duties as Master of the Mint, to 
the scientific work of his contemporaries, and to defending his own work 
against the attacks of jealous rivals. In this same year, he was elected 
President of the Royal Society, an office which he held until his death. 
In 1705, he was knighted by Queen Anne. 

Most of Newton’s important scientific work was done before he 
vacated the professorship, although he remained thereafter “a power of 
the first magnitude in the world of science.” In his later years, he 
devoted much time to theological studies. Throughout his life he 
shunned publicity, retaining a modesty and simplicity which are indicated 
by a sentiment uttered shortly before his death: 


I do not know what I may appear to the world, but to myself I seem 
to have been only like a boy playing on the seashore, and diverting 
myself in now and then finding a smoother pebble or a prettier shell 
than ordinary, whilst the great ocean of truth lay all undiscovered 
before me. 


Any brief account of Newton’s work must inevitably give a very 
inadequate impression of his contributions to science. We refer here 
only to a few of his researches on optics and on mechanics. Newton’s 
work on oplics arose out of an attempt to improve lenses. The inability 
of a lens with spherical surfaces to bring parallel rays to a point focus 
was early recognized. In 1629, Descartes had shown that lenses with 
hyperbolic or, under certain conditions, parabolic surfaces should be 
free from the defect which we now call spherical aberration. Newton 
found, however, that such lenses produced only a very slight improve- 
ment in the image, and he conjectured that perhaps the trouble lay not 


in the lens but in the light itself. 


` 


18 Introduction to Modern Physica 


Leas ` (oy 
Fig. 1.2 Sir Isaac Newton. 


Accordingly, he procured a prism of glass and, placing it over a 
hole 4 in. in diameter through which sunlight was shining into a darkened 
room, he observed the “very vivid and intense colors” produced on a 
wall some 20 ft distant. Newton was surprised to find that this spec- 
trum, as we now call it, was so much longer than it was wide (134 by 
2% in.). The width subtended at the hole an angle corresponding exactly 
to the sun’s angular diameter, but the length could not be so explained. 
He made various surmises as to the origin of the colors, such as the 
varying thickness of the prism from apex to base, the unevenness of the 
glass, a curvilinear motion of the light after leaving the prism, etc. 
One by one, experiment proved these hypotheses wrong. Finally, he 
isolated one ray, or “color,” by suitable screens and caused it to pass 
through a second prism. In this way, he could measure the refrangibility 
of a single ray. He found that the refrangibility increased from red 
to violet; that, therefore, the first prism simply “sorted out” the colors, 
which, in combination, made “white” light. In other words, he dis- 
covered that white light is made up of the spectral colors, an elementary 

_ concept to us but new and of far-reaching importance in 1666. 
a Newton at once saw that this dispersion of light was the cause of 
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his failure to effect any substantial improvement in telescopes by use of 
paraboloidal lenses. Furthermore, he concluded, on the basis of a 
hurried experiment, that in different media dispersion was always pro- 
portional to refracting power. If this were so, no combination of lenses 
of different materials could eliminate chromatic aberration. This singu- 
lar error of Newton's retarded the development of refracting telescopes 
for years. In 1730, Hall made several achromatic combinations of 
crown and flint glasses, but he published no account of his work, so 
that when Dolland, about 1757, rediscovered the method of making 
such combinations, he was able to secure a patent on it—an invention 
that had been within the grasp of Newton three-quarters of a century 
before. z i 

Newton’s theories concerning the nature of light are of historical 
interest. Much has been written concerning the extent to which he 
may have retarded the development of optics by espousing the corpuscular 
theory as against the wave theory of his contemporaries Huygens (1629- 
1695) and Hooke (1635-1703). Newton was by no means dogmatic 
in his support of the corpuscular theory. In a 1675 communication to 
the Royal Society concerning “An Hypothesis Explaining the Properties 


of Light,” Newton states: a 
I have here thought fit to send you a description . . . of this 
hypothesis . . . though I shall not assume either this or any other 


hypothesis, not thinking it necessary to concern myself whether the 
properties of light discovered by men be explained by this or any 
other hypothesis capable of explaining them; yet while I am describ- 
ing this, I shall sometimes, to avoid circumlocutation . . . speak of 
it as if I assumed it. 


He then proceeds to describe “an aetherial medium, much of the same 
constitution with air but far rarer, subtiler and more strongly elastic” 
and supposes that 


. . . light is neither aether, nor its vibrating motion, but something 
of a different kind propagated from lucid bodies. They that will 
may suppose it an aggregate of various peripatetic qualities. Others 
may suppose it multitudes of unimaginable small and swift corpuscles 
of various sizes springing from shining bodies . . . and continually 
urged forward by a principle of motion which in the beginning 
accelerates them, till the resistance of the aetherial medium equals 
the force of that principle much after the manner that bodies let fall in 


water are accelerated till the resistance of the water equals the force 
of gravity. q 


In 1704 Newton published his optical researches in his well-kno; 
book “Opticks,” of which the third edition appeared in 1724. 
first sentence reads, “My Design in this Book is not to explain the 
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Properties of Light by Hypotheses, but to propose and prove them by 
Reason and Experiment.” He describes his researches on refraction, 
reflection, colors of thin plates, etc., and he concludes by “proposing only 
some queries in order to further search to be made by others.” One of 
these queries expresses his objections to the wave theory of light: 


28. Are not all hypotheses erroneous in which light is supposed to 
consist in pression or motion propagated through a fluid medium? 


If light consists only in pression propagated without actual motion, 
it would not be able to agitate and heat the bodies which refract and 
reflect it, and . . . it would bend into the shadow. For pression or 
motion cannot be propagated in a fluid in right lines beyond an 
obstacle . . . but will bend and spread every way into the quiescent 
medium which lies beyond the obstacle. 


To account for the colors of thin films, which are now regarded 
as strong evidence for the wave properties of light, he supposes that 
every ray of light in its passage through any refracting surface is put 
into a certain transient constitution or state, which in the progress of 
the ray returns at equal intervals and disposes the ray at every return 


to be easily refracted through the next refracting surface and between 
the returns to be easily reflected by it. 


He even suggests that the effect might be due to vibrations excited by 
the “rays” in the material medium, vibrations which 


move faster than the rays so as to overtake them; and that when any 
ray is in that part of the vibration which conspires with its motion, 
it easily breaks through a refracting surface, but when it is in the con- 
trary part of the vibration which impedes its motion, it is easily 
reflected. . . . But whether this hypothesis be true or false, I do not 
here consider. 


Newton regarded his corpuscular theory as tentative and subject 
to confirmation on the basis of further experiments. If his theory did 
retard progress in optics, the fault lay rather with those who attached 
too great weight to his opinions. 

Newton’s speculations also serve as an example to illustrate the 
rule that even the greatest intellect works on the basis of the knowledge 
and viewpoints of its age. Had Newton lived a century later, he might 
have been one of the first believers in the wave theory. The fact that 
great scientists share the limitations of their age is an added reason for 
treating their speculative opinions chiefly as sources for further 
experiment. : 

Newton’s researches on optics alone would have given him a high 
rank among the scientists of his time. But of still greater value was his 
work in mechanics. In announcing that “every particle of matter in 
the universe attracts every other particle with a force inversely pro- 
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portional to the square of the distance between the two particles” and 
in showing that this one universal law governs the motions of the planets 
around the sun and of the satellites round their planets, he gave to the 
world a truth which exercised an enormous influence upon thought. 
This achievement of Newton’s played a large part in bringing about 
the general conviction that the physical universe in its entirety is governed 
by law, not by caprice. 

Newton himself has told us how he came to discover the law of 
gravitation. First he attacked the problem of finding a law of attraction 
between two bodies, such as the sun and a planet, which would result in 
Kepler's third law. He found that a gravitational attraction varying 
as the inverse square of the distance gives this law of planetary motion. 


, Then he saw that a test of this inverse-square law could easily be made 


by comparing the acceleration of the moon toward the earth with the 
acceleration of falling bodies at the surface of the earth. It was known 
that the distance between the moon and the earth’s center is about 60 
earth radii. By the inverse-square law, therefore, in 1 s the moon 
should “drop” toward the earth 1/60? as far as a body at the surface 
of the earth drops in 1 s. The latter distance being, from observations 
on falling bodies, 16 ft, the former should be 16/60? ft, or 16 ft in 1 min. - 
But the acceleration of the moon could be determined directly by apply- 
ing to the moon the expression he had used for the moti ets 
in their orbits, viz., X 


5 
= 4r? T 
where v = velocity of moon in its orbit 
T = period of moon’s motion around earth 
r = radius of orbit 
Now r is equal to 60 times the earth’s radius, which was as 
3436 miles on the then common assumption that a degree of latitude 
is 60 miles. On this basis, the moon “drops” 13.9 ft toward the earth 
in a minute, instead of 16 ft, as predicted by the inverse-square 
law. 

Newton was twenty-three years old at the time, and he laid this cal- 
culation aside, not mentioning it to anyone. Some years later, however, 
he learned of a more accurate determination of the length subtended bya 
degree. Picard found it to be more nearly 70 miles. Meantime Newton 
had succeeded in proving that a homogeneous sphere attracts an external 
body as if all its mass were concentrated at its center, thereby removing 
one uncertainty in the calculation. On the basis of Picard’s value 
Newton revised his computations on the moon’s acceleration and, to his 
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great joy, found that it falls toward the earth 16 ft in a minute, just 
as predicted by the inverse-square law. At last, he had discovered the 
law of gravitation on the basis of which he could derive all three of 
Kepler's laws, 

These results, together with some propositions on the motion of 
the planets, were communicated in 1683 to the Royal Society, which 
requested permission to publish Newton’s complete researches on the 
subject of motion and gravitation. In 1687, there appeared the firat 
edition of his celebrated “Philosophiae Naturalis Principia Mathematica” 
(Mathematical Principles of Natural Philosophy). The treatise is 
divided into three books, the subject matter of each being presented 
by propositions, theorems, and corollaries. The first two books deal 
with general theorems concerning the motions of bodies, and the third 
contains applications to the solar system. The entire treatise is char- 
acterized by the exposition of the principle of universal gravitation and . 
its ramifications, but, as the author carefully points out, without attempt- 
ing any hypothesis as to the cause of gravitation. 

The treatise assumes Newton's famous three laws of motion as 
axioms and makes other important contributions to mechanics, It 
introduces the concept of mass and describes a series of experiments 
showing that the period of a pendulum is independent of the material 
of which it is made, from which Newton concluded that for different 
bodies, mass and weight are proportional to each other. He also gave 
precision: to the idea of force and formulated in general terms the prin- 
ciple of the parallelogram of forces. 

Of Newton’s invention of the method of fluxions, i.e., the calculus, 
of his very interesting miscellaneous writings, of the many controversies 
with his contemporaries into which he was unwillingly drawn in defense 
of his scientific work, we cannot take space to write. We can only urge 


our readers to make further study of the life and works of this renowned 
physicist. 


1.16 ‘Newton’s Contemporaries 


The true productive period of Newton’s life ended about 1700. His 
biography is so full of interest and inspiration that it is a temptation 
to discuss similarly the work of his contemporaries, themselves eminent 
scientists—Robert Boyle (1627-1691), the discoverer of Boyle’s law; 


“Christian Huygens’ (1629-1695), whose wave theory of light was to 


triumph a century and a half later, and whose other contributions 
included important work on conic sections, the theory of probability, 
the pendulum clock, mechanics, and the improvement of the telescope 


-and microscope; Robert Hooke (1635-1703), proponent of the undulatory 
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theory and originator of Hooke’s law of elasticity; Leibnitz (1646-1716), 
whose calculus ultimately replaced Newton’s fluxions. But, remember- 
ing that the main business of this book is modern physics, we pass on 
to a rapid review of the developments of physics during the eiahteenth 
century. 


1.17 Mechanics during the Eighteenth Century 


The subsequent history of mechanics has consisted of the derivation, 
from Newton’s three laws, of various secondary principles which are 
convenient for special purposes and of the solution of all sorts of prob- 
lems. Among the prominent workers during the eighteenth century, we 
find such names as Daniel Bernoulli (1700-1782), who worked on hydro- 
dynamics, the kinetic theory of gases, and the transverse vibrations of 
rods; Euler (1707-1783), who shares with Bernoulli the honor of dis- 
covering the general law of the conservation of angular momentum 
(1746); and Lagrange (1736-1813), who gave, in the equations that 
bear his name, a general method of attacking mechanics problems, 
using any sort of coordinates that may be convenient. 


1.18 Heat during the Eighteenth Century 


Galileo had invented an air thermometer in 1597; the first mercury 
thermometer was used by Kircher in 1643. About 1724, Fahrenheit 
proposed the temperature scale now known by his name; this was followed 
by the Réaumur scale and, in 1742, by the Celsius scale. James Black 
(1728-1799), a professor of chemistry at Glasgow and Edinburgh, made 
measurements of the heat of fusion and of vaporization of water, which 
led to modern calorimetry and gave definite form to the previously 
hazy distinctions between temperature and heat. 

With regard to theories of heat, however, there was retrogression 
during the eighteenth century. From his writings, it is clear that 
Newton regarded heat as intimately connected with the motion of the 
small particles of which bodies are composed, and this view seems to 
have been shared by his contemporaries. But early in the eighteenth 
century, there was a return to the caloric theory, which held heat to be a 
fluid that could be extracted from, or added tó, a body, thereby causing its 
changes of temperature. This heat fluid was indestructible, its parti 
were self-repellent but were attracted by ordinary matter, and it w 
pervading. The expansion of bodies when heated was the natural, 
of “awelling” due to forcing caloric into matter. The production 
by percussion was due to the releasing or ‘pounding loose” of 
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the caloric naturally condensed in or absorbed by the body, thereby 
increasing the amount of free caloric. Black explained latent heats and 
specific heats on the basis of this theory. Indeed, by the end of the 
eighteenth century, the caloric theory of heat was widely accepted. 


1.19 Light during the Eighteenth Century 


An event of special importance in the history of science was the ae 
covery of the aberration of light by Bradley in 1728. The absence 2 
any measurable stellar parallax had been one of the stumbling Bloka 
in the way of the Copernican system and was, therefore, one of rs g 
outstanding problems of pronom: rE EO ERNE hes recogpizean = 
viewed from opposite sides of t e earth’s orbit, $ T 
i isplacement, but his careful observations convince 
n A so aan ae ae of arc existed. Later 
i ise sought such an effect in v: n. 
sepsis keis peah to measure stellar distanser peo in 
1725, systematic observations on the position of a Zenit sis í ae 
If stellar parallax existed, this star should be fart! es putain ae 
and should then move north, reaching its mimu He n y R sition 
6 months later. The position of the star was ound to change but not 
in the manner expected. It reached farthest south in March and farthest 
h in September, the angular distance between the two positions being 
mia 40 seconds of arc. Bradley continued his observations on other 


served displace- 
hift in the star’s 
with that of the 
e for the velocity 
agreement with that 
a study of the motion 
determination of the 
s the first in the series 
theory of relativity. 

Theories about the nature of light made no material progress during 
the eighteenth century because of the lack of any crucial experiment, 


as was also true of theories of heat, Science has seldom progressed 
on the basis of speculation only. 


1.20 Electricity during the Eighteenth Century 
Electricity received Breat attention during the eighteenth century, but 
research was principally in electrostatics, Stephen Gray (1670-1736) 
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distinguished between conductors and nonconductors and proved that 
conducting bodies can be electrified provided they are insulated. Du 
Fay (1698-1739) showed that flames exercise a discharging power and 
that there are two kinds of electricity, which he called vitreous and 
resinous. He was thus led to propose the two-fluid theory of electricity. 
During the first half of the eighteenth century, the electroscope was 
invented by Hauksbee in 1706; frictional electric machines were developed; 
the Leyden jar was introduced in 1745; and there was considerable 
popular interest in electrical phenomena. During the latter half of the 
century, three names are preeminent: Benjamin Franklin (1706-1790), 
Henry Cavendish (1731-1810), and Charles A. de Coulomb (1736-1806). 

Franklin’s experiments began about 1745. One of his first observa- 


» tions was the effect of points “in drawing off and throwing off the elec- 


trical fire.” He proposed the one-fluid theory of electricity, somewhat 
similar to the caloric theory of heat. This theory supposed that all 
bodies contained naturally a certain amount of the electric fluid. When 
a body had an excess of the fluid, it exhibited one of the two kinds of 
electrification, which Franklin chose to call for this reason posilive; when 
it had a deficit, it exhibited the other kind, which he called negalive. 
Certain features in the appearance of electric sparks led him to identify 
his positive electrification with that which had been called vitreous. 

About 1750, Franklin began to speculate on the identity of electric 
discharges and lightning, pointing out many similarities and proposing, 
by means of a pointed iron rod, to “draw off the fire from a cloud.” 
In 1752 Dalibard tried the experiment in Paris, confirming Franklin’s 
prediction. A short time later, Franklin performed the famous kite 
experiment, well known to every schoolboy; this led to his study of 
atmospheric electricity and to his invention of the lightning rod. Frank- 
lin’s researches occupied but a small portion of his long and busy life, 
but they were sufficient to give him a high standing among the scientists 
of the world. 

Quanlitative researches in electricity began with Cavendish and 
Coulomb. Cavendish is known not only for his work in electrostatics 
but also for his researches in chemistry and for the well-known Cavendish 
experiment, in 1798, in which he determined the constant of gravita- 
tion. His electrical researches were extensive, but most of his work 
remained unknown, for he published only one paper of importance. He 
left behind a wealth of manuscript notes, which were edited and published 
in 1879 by Maxwell. In these experiments, Cavendish tested the inverse- 
square law of electrostatic force; measured capacitance, recognized the 
principle of the condenser, and measured the specific inductive capacity 
of several substances; arrived at a reasonably clear idea of the quantity 
which we now call potential; and anticipated Ohm’s law by 50 years. 
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Had these important measurements been communicated to his scientific 
contemporaries, the history of electricity might have been substantially 
modified. As it is, the credit of discovery seems fairly to belong to others, 
for a discovery is of no importance to any one else if it is kept secret. 

Coulomb’s work in electricity grew out of his development of the 
torsion balance, originally used for studying the torsional elasticity of 
wires. In the period from 1785 to 1789 he published seven papers on 
electricity and magnetism in the Mémoires de l’Académie Royale des 
Sciences in which he showed that electrostatic forces obey the inverse- 
square law; that, on conducting bodies, the charge exists only on the 
surface; and that the capacitance of a body is independent of the nature 
of the material of which it is composed. 


1.21 Close of the Second Period 


The end of the eighteenth century found 
three divisions of physics: the caloric vs, th 
corpuscular vs. the undulatory theory in 
the two-fluid theory in electricity, 


rival theories contending in 
e kinetic theory in heat; the 
light; and the one-fluid ys. 
The very fact that these issues were 
raised in clean-cut fashion is an indication of the tremendous strides 
that had been taken since Galileo. But most important of all, men 


had learned the value of experiment and the fallacy of blindly following 
“authority.” 


Third Period (A.D. 1800-1890): the Rise of 
Classical Physics 


1.22 The Nineteenth Century in Physics 


century. 

In mechanics, there was Hamilton, who discovered in the hamilto- 
nian equations a form of the equations of motion which is particularly 
valuable for attacking problems in quantum mechanics. The theory 
of motion of rigid bodies, including th 


f e gyroscope, was worked out, as 
well as the mathematical theory of elasticity, Hydrodynamics, dealing 
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with the motion of fluid of all sorts, was developed. In the flow of 
viscous fluids, however, only simple problems could be solved; extensive 
study of such fluids, by half-empirical methods, was not made until 
the present century, after the invention of the airplane. 

Work in other fields was more striking. The most significant dis- 
coveries and advances were the establishment of the kinetic theory of 
heat and the development of the kinetic theory of gases; the victory(?) 
of the wave theory of light over the corpuscular theory; the formulation 
of the general law of the conservation of energy; the discovery of the 
second law of thermodynamics; and above all the discovery, by Faraday 
and others, of the whole range of electromagnetic phenomena, culminat- 
ing in Maxwell's theory of the electromagnetic field and of light. 

Of these lines of advance we select three for brief discussion, 
choosing those which bear directly upon modern developments in physics; 
and to exemplify the great scientists of the period, we discuss Faraday 
and Maxwell. 


1.23 Heat and Energy 


The law of the conservation of energy is one of the most fundamental 
and far-reaching of all physical laws; yet it is of comparatively recent 
origin, for it was not announced until the middle of the nineteenth 
century. As exemplified in mechanics, it had been recognized during 
the eighteenth century, in the theory of the vis viva; but its announcement 
as a law of universal application awaited experimental work demon- 
strating the definite equivalence of heat and mechanical work. 

An early qualitative experiment bearing on the nature of heat was 
performed in 1798 by Count Rumford, an American who had fled to 
England in 1775 and eventually became a sort of military engineer to 
the Bavarian government. Impressed by the large amount of heat that 
was developed in boring cannon, he performed experiments indicating 
that this heat was too great to be accounted for plausibly by the caloric 
theory. He could find no loss of weight when the chips made by boring 
were included and showed that the specific heat of the chips was the 
same as that of the block from which they had come. He concluded 
that heat “cannot possibly be a material substance” such as caloric but 
must be a form of “motion.” ; 

A more difficult experiment for the caloric theory to explain was 
one performed by Sir Humphry Davy, director of the Royal Institution, 
which had been founded by Count Rumford. Davy rubbed together 


-two pieces of ice in a vacuum surrounded by walls kept below the freezing 


point and melted the ice. Here the mechanical work of rubbing accom- 
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plished exactly the same effect that could have been produced by the 
addition of heat from outside, yet there was no way in which caloric 
could have entered the ice. The majority of the supporters of the caloric 
theory were, however, unconvinced. Even Carnot (1796-1832), a pioneer 
in thermodynamics, based his reasoning on the caloric theory when he 
proposed the now famous cycle in 1824. 

In 1842, R. J. Mayer (1814-1878) published a paper in which, 
partly on philosophical grounds, he announced the equivalence of heat 
and energy, and from data on the specific heats of a gas he deduced a 
value for the mechanical equivalent of heat. Meanwhile, Joule (1818- 
1889), in England, unacquainted with Mayer’s work, was carrying on 
experiments in which he converted the mechanical energy of a falling 
weight into heat by a paddle wheel revolving in water and thus deter- 
mined that 778 ft-lb of work would raise the temperature of 1 Ib of water 
1°F. Joule announced his results at a meeting of the British Associa- 
tion for the Advancement of Science in 1847. The paper would have 
passed almost unnoticed had it not been for William Thomson, later 
Lord Kelvin, who, grasping the real significance of the proposed theory, 
made the paper the event of the meeting. 

Independently of the work of Mayer and of Joule,. Helmholtz 
(1821-1894) in 1847 read a paper before the Physical Society in Berlin 
on “Die Erhaltung der Kraft,” in which, on the basis of the impossibility 
of perpetual motion machines, he announced the law of the conservation 
of energy. The paper, rejected by the editor of the Annalen der Physik, 
was published in pamphlet form. 

The caloric theory could not withstand these attacks, and by 1850 
the mechanical theory of heat and the doctrine of the conservation of 
energy were generally accepted. p 

The second law of thermodynamics was announced by Clausius 
(1850) and, in another form, by Kelvin (1851), and in 1854 Kelvin pro- 
posed the thermodynamic scale of temperature. Thus was developed the 
highly successful classical theory of heat. We shall discuss later the 
facts that this theory, in turn, was unable to explain and how these 
facts led to the development of the quantum theory. 


1.24 Light 


The revival of the wave theory of light, begun by Thomas Young (1773- 
1829), is one of the important features of the history of the nineteenth 
century. Young pointed out that the division of a beam of light into a 
refracted ray at the interface between two media was to be expected 
from the wave theory but had not been satisfactorily explained by the cor- 
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puscular theory. In 1801, he presented to the Royal Society a paper / 
“On the Theory of Light and Colors,” in which he proposed the principle ` 
of the interference of two wave trains as an explanation of Newton's 
rings and the colors of thin plates. From Newton’s measurements of 
the thickness of the air layers necessary to produce the several colors, 
Young was enabled to compute wavelengths. In subsequent papers, 
he described the interference fringes which he had observed by placing 
hairs or silk threads in front of a narrow slit illuminated from the rear; 
he announced the change of phase on reflection; he explained diffraction 
bands by the principle of interference, showing that since the spacing 
of these bands gave values of the wavelength agreeing with those obtained 
from Newton’s rings, both phenomena must therefore be due to a com- 
mon cause. Again, quantilalive measurements became an indispensable 
link in the chain of reasoning. 

But the dogmatic spirit in regard to scientific matters was not yet 
dead. Young's paper aroused a storm of protest, even of derision and 
abuse. His chief assailant was Henry Brougham, afterward Lord 
Chancellor of England, who “reviewed” Young's papers in the Edinburgh 
Review writing: 


We wish to raise our feeble voice against innovations that can have 
no other effect than to check the progress of science and renew all 
those wild phantoms of the imagination which Bacon and Newton 
put to flight from her temple. We wish to recall philosophers to the 
strict and severe methods of investigation . . . . 


In 1815, Fresnel (1788-1827) rediscovered the phenomenon of 
interference, performing the famous experiment with the two mirrors. 
A few years later he developed a mathematical theory of such phe- 
:omena (1818-1819). He also explained the polarization of light by 
assuming that the light vibrations in the ether are transverse to the direc- 
tion of propagation of the light rather than longitudinal. (This sugges- 
tion had already been made by Young in a letter to Arago written in 
1817.) Fresnel supported the explanation by showing experimentally 
that two plane-polarized beams of light cannot interfere at all if their 
planes of polarization are perpendicular to each other. Phenomena of 
polarization had been Known to Newton, the polarization of light by 
Iceland spar being discovered by Bartholinus in 1669. Newton had 
tried to fit these phenomena into the corpuscular theory by assuming 
a sort of structure in the corpuscles, but the explanation’ was not con- 
vincing, and polarization had remained an enigma to both theories of 
light. Fresnel’s explanation assumed that light was a mechanical-wave 
motion through a medium known as the luminiferous ether which had the 
properties of rigidity and density needed to transmit transverse waves. 
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(And yet the planets move through this ether with no measurable drag!) 
On the basis of these mechanical waves Fresnel derived correct expressions 
for reflection and transmission coefficients as a function of incidence 
angle and polarization when light passes from one medium to another. 
It may seem surprising that correct equations could come from such 
a modei, but both Fresnel’s mechanical waves and electromagnetic 
waves involve equivalent wave equations, and analogous boundary con- 
ditions are applied. Thus the precise nature of the w 
significant role in determining reflection and transmissi 

Experimental evidence for the wave theory continue 
In 1850 Foucault performed a crucial experiment, sl 
revolving-mirror apparatus that light travels more slow 
in air, as predicted by the wave theory. 

From 1850 until the end of the third period (1890), the wave theory 
held the field undisputed. The frequent assertions that the corpuscular 
theory was finally disposed of certainly seemed justified, particularly 
after the development of Maxwell’s electromagnetic theory of light and 
its experimental verification. Yet the corpuscular theory was not dead; 
it was only sleeping. 

Some important discoveries in light from 1800 to 1 
viously mentioned are: 


aves plays no 
on coefficients. 
d to accumulate. 
howing with his 
ly in water than 


890 not pre- 


Discoverer 
Dark (Fraunhofer) lines in the solar spectrum (1801) Wollaston 
Three-color theory of vision (1807) Young 


Rotary polarization of quartz (1811) 


Arago 
Polarization of scattered light (1813) Arago 
Rotary polarization by liquids (1815) Biot 


Light sensitivity of silver bromide (1826) Balard 
Change of conductivity of selenium on illumination (1837) Knox 
Doppler effect (1842) 


Doppler 
Foundation of spectral analysis (1859) 


Kirchhoff and Bunsen 


1.25 Electricity and Magnetism 


The history of electricity during the nineteenth century is so extensive 
that even a sketchy outline would fill a small volume. We discuss 
little besides the fundamental discoveries of the opening decades and 
then the work of Faraday and Maxwell, which are closely related to 
each other and to recent developments in physics. 

While the mathematical theory of electrostatics and of magnetism 
was being elaborated by Laplace, Green, and Poisson, fundamental 
discoveries were made in regard to electric currents. In 1786 Galvani, 
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as a result of a chance observation that a frog’s leg kicked convulsively 
when connected with the terminal of an electric machine, was led to an 
extensive study of “animal electricity.” Ifthe frog’s leg was so suspended 
that the exposed nerves touched a metal plate, say silver, then a contrac- 
tion of the muscle occurred whenever the foot touched another metal, 
say iron. He even observed slight muscular contraction when both 
plates were of the same kind of metal. This led him to believe that 
the nerve was the source of electricity and that the metal served simply 
as conductor. Volta later found that potentials could also be produced 
using inorganic materials, and in 1800 he described the first battery for 
producing an electric current—the historically famous voltaic pile, con- 
sisting of alternate zinc and copper plates separated by blotting paper 
moistened with brine. He also described a battery consisting of cups 
containing brine or dilute acid connected by copper and zinc strips 
joined together. Volta ascribed the effect to the contact of two dissimilar 
metals. We now know, however, that the electromotive force is due to 
chemical action at the contacts of the metals with the electrolyte. 

This new source of electricity was received with great interest. 
A few weeks after hearing of Volta’s work, Nicholson and Carlisle 
discovered the decomposition of water by the electric current. Thinking 
to secure better contact between two wires forming part of the circuit, 
they had joined the ends of the wires by a drop of water. At once they 
observed the formation of a gas, which they recognized as hydrogen. 
This was the beginning of the study of electrolysis. During this same 
period the heating effect of the current and the arc light were discovered. 

It was early suspected that there was some relation between elec- 
tricity and magnetism, but the first significant discovery was made in 
1820 by Oersted, who found that a magnetic needle tends to set itself 
at right angles to a wire through which an electric current is flowing. 
Soon after, Biot and Savart discovered the law for the field of a long 
straight current, and toward the end of 1820 Biot proposed the formula 


ds xr 
rs 


dB = xI 


for the contribution to the magnetic induction B due to a current element 
I ds; this relation is often miscalled “Ampère’s formula.” Soon after, 
the brilliant French physicist Ampère (1775-1836) showed that a closed 
current is equivalent in its magnetic effects to a magnetic shell. Then, 
reversing his line of thought, he suggested that magnetism itself might 
be due to currents circulating in the molecule. He also discovered the 
action of a magnetic field on a current. Within 5 years of the first 
discovery, the foundations of electromagnetism had been laid. 
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a. Biographical Sketch Michael Faraday was born in 1791 in a small 
village near London. To help his mother provide for the family, he 
started in 1804 as errand boy to a bookseller and stationer, and in the 
following year he was formally apprenticed to his employer ‘to learn 
the art of bookbinding. Faraday made good use of his spare time by 
reading some of the books that passed through the shop. He was 
particularly interested in works on science, and in connection with his 
reading he showed one of the important characteristics of the great 
investigator-to-be by performing such of the simple experiments described 
“tas could be defrayed in their expense by a few pence per week.” 

Aside from his own reading, Faraday’s scientific education consisted’ 
in a dozen lectures on natural philosophy by a Mr. Tatum and four lectures 
on chemistry by Sir Humphry Davy, in the winter of 1812. Submitting 
the very careful and neatly written notes which he made of these lectures 
“as proof of his earnestness,” he made bold to apply to Sir Humphry 
Davy for a position, however menial, at the Royal Institution, of which 
Davy was director. Davy was so pleased with the letter and the notes 
that in March, 1813, Faraday was engaged as apparatus and lecture 
assistant at 25 shillings per week. In October, 1813, he accompanied 
Sir Humphry and Lady Davy on a ‘rip to the Continent, which took 
them to many of the important scientific centers of Europe. Assistant 
though he was, Faraday impressed others because of his modesty, 
amiability, and intelligence; said one writer, “We admired Davy, we 
loved Faraday.” 

On returning to England, under Davy’s encouragement, Faraday 
began original investigations, initially in chemistry. From 1816 to 1819 
he published 37 papers concerned with such subjects as the escape of 
gases through capillary tubes, the production of sound in tubes by 
flames, the combustion of the diamond, and the separation of manganese 
from iron. About 1820, he began his electrical researches, which con- 
tinued for nearly 40 years. 

Almost his entire scientific life was spent at the Royal Institution. 
In 1825, he was made Director of the Laboratory. Declining offers of 
positions elsewhere which might have made him wealthy, he gave to 
his science and to the institution he served a devotion seldom equaled. 
The secret of his success, which brought him honors from all over the 
scientific world, is perhaps to be found in some excerpts from his many 
notes: 


Aim at high things, but not presumptuously. 
Endeavor to succeed—expect not to succeed. ; 
It puzzles me greatly to know what makes the successful iiez 


o 
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Fig. 1.3 Michael Faraday. 


pher. Is it industry and perseverance with a moderate proportion 
of good sense and intelligence? Is nota modest assurance or earnest- 
ness a requisite? Do not many fail because they look rather to 
the renown to-be acquired than to the pure acquisition of knowl- 
edge... ? Iam sure I have seen many who would have been good 
and successful pursuers of science, and have gained themselves a high 
name, but that it was the name and the reward they were always 
looking forward to—the reward of the world’s praise. In such there 
is always a shade of envy or regret over their minds and I cannot 
imagine a man making discoveries in science under these feelings. 


b. The Principle of the Motor Faraday had been interested in electro- 
magnetism since April, 1821, when Wollaston attempted, at the Royal 
Institution, to make a wire carrying an electric current revolve around 
its own axis when the pole of a magnet was brought near. The experi- 
ment was unsuccessful, but the phenomenon excited Faraday’s interest, 
and he determined ‘to make a study of it. First, he read what had been 
done by others.and repeated many of their experiments. In the course 
of these experiments, he observed that, when the magnetic pole was 
brought near the wire, “the effort of the wire is always to pass off at 
right angles from the pole, indeed to go in a circle around it... .” 
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On Sept. 4, 1821 he wrote in his laboratory notebook: 
Apparatus for revolution of wire and magnet. A deep basin with 

a bit of wax at bottom and then filled with mercury. A magnet 

stuck upright in wax so that pole [is] just above surface of mercury. 


Then piece of wire, floated by cork, at lower end dipping into mercury 
and above into silver cup. 


When a current passed through the wire, it revolved continuously around 
the magnet. This was the first electric motor! 


c. Electromagnetic Induction Oersted’s experiment and subsequent 
developments had clearly shown how “to produce magnetism by elec- 
tricity.” Faraday seems to have held it as one of the tenets of his scien- 
tific philosophy that every physical relation (of cause and effect) has its 
converse. If electricity can produce magnetism, then magnetism should 
produce electricity. His repeated attempts to accomplish this failed. 
In 1825, he tried what seemed to be the obvious converse by looking 
for an electric current in a helix of wire coiled around a magnet. Later, 
he tried to find a current in a wire placed near another wire carrying 
current. Other scientists were looking for similar effects but without 
success. They were all looking for the production of a steady current. 

But several times investigators were very near to the discovery of 
induced currents. In 1824, Arago observed the damping of the vibra- 
tions of a magnetic needle suspended over a copper plate. This obser- 
vation was extended by causing the needle to revolve by revolving the 
copper plate underneath it. It was shown that this “dragging” effect 
was greater, the greater the electrical conductivity of the spinning plate. 
Even the effect of radial slits in the copper disk, in reducing the dragging 
action on the magnet, was observed. Suggestive as these experiments 
were, the true explanation remained undiscovered. 

In the summer of 1831 Faraday attacked the problem a fifth time. 
Instead of placing a permanent magnet inside a helix, he procured a soft 
iron ring, 6 in. in external diameter, on which he wound two coils of 
copper, A and B, “separated by twine and calico.” To detect a possible 
current in coil B, he “connected its extremities by a copper wire passing 
to a distance and just over a magnetic needle.” When coil A was 
connected to a battery, there was “a sensible effect on the needle. It 
oscillated and settled at last in original position. On breaking connec- 
tion of side A with battery, again a disturbance of the needle.” Slight 
as these momentary effects were, Faraday recognized their importance, 
although he had been looking for a continuous effect. On Aug. 30, 
he writes, “May not these transient effects be connected with causes of 
difference between power of metals at rest and in motion in Arago’s 
experiments?” 


The Heritage of Modern Physics 35 


From this slender clue, Faraday proceeded rapidly to the discovery 
of the real effect. On the third day of his experiments, he wound a coil 
of wire around an iron cylinder and placed the cylinder so as to join the 
north pole of one permanent magnet with the south pole of another. The 
coil was connected to a galvanometer: 


Every time the magnetic contact at N or S was made or broken 
there was a magnetic action at the indicating helix [i.e., galva- 
nometer]—the effect being, as in former cases, not permanent but a 
mere momentary push or pull. 


On the fourth day, he showed that the presence of iron was not necessary: 
the effect could be produced by the action of one helix on another. 
On the fifth day: 


A cylindrical bar magnet . . . had one end just inserted into the 
end of the helix cylinder, then ‘it was quickly thrust in the whole 
length and the galyanometer needle moved; then pulled out and 
again the needle moyed, but in the opposite direction. The effect was 
repeated every time the magnet was put in or out, and therefore a 
wave of electricity was so produced from mere approximation of a 
magnet and not from its formation in situ. 


At last he had “converted magnetism into electricity.” The’ essential 
requisite was relalive motion, or a change of condition. On the ninth 
day, he produced a continuous current by turning a copper disk between 
the poles of a powi erful electromagnet, the periphery of the disk being 
connected to its axis through an indicating galvanometer. This was the 
now well-known Faraday disk dynamo, the first dynamoelectric machine. 

Thus, after only a few days’ work in his laboratory, following 
years of patient and persistent experiment, Faraday had discovered 
a phenomenon for which the greatest scientists of his time had sought i in 
vain—clectromagnetic induction. 

Following this discovery, Faraday devised and tried various electric 
machines to test and extend his newly discovered principle. One of 
these machines, consisting of a rotating rectangle of wire with a com- 
mulator altached, is the prototype of the modern dynamo. But his 
interest was always in pure science, for he writes: 


I have rather, however, been desirous of discovering new facts and 
relations dependent on magnetoelectric induction, than of exalting 
the force of those already obtained; being assured that the latter 
would find their full development hereafter. 


Being unacquainted with mathematical symbols and methods, 
Faraday always sought to explain his discoveries and to extend his 
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researches by purely physical reasoning. To the mathematician, the 
law of attraction between magnetic poles, 


mms 
F=x—; 
r 


may be a sufficient explanation of the phenomenon. To Faraday, this 
gave a statement only of the magnilude of the magnetic forces; it left 
the phenomenon ilself quite unexplained. He insisted that two magnetic 
poles, or two electric charges, could act on each other only if the medium 
belween the two played some important part in the Phenomenon. This 
insistence on the importance of the medium ultimately led him to the 
concept of lines of force and of the “cutting” of these lines as essential 
to electromagnetic induction. At first qualitative, this concept was 
developed by Faraday into an essentially quantitative form, although 
it was first stated in mathematical language by F. Neumann in 1845. 


Commenting on Faraday’s Jaws of electromagnetic induction, Maxwell 
wrote: 


After nearly a half-century . . . , we may say that, though the 
practical applications of Faraday’s discoveries have increased and are 
increasing in number and value every year, no exception to the state- 
ment of these laws as given by Faraday has been discovered, no new 
law has been added to them, and Faraday’s original statement 
remains to this day the only one which asserts no more than can be 
verified by experiment, and the only one by which the theory of the 
phenomena can be expressed in a manner which is exactly and numer- 
ically accurate, and at the same time within the range of elementary 
methods of exposition. 


d. The Laws of Electrolysis Faraday next turned his attention to 
proving that “Electricity, whatever may be its source, is identical in its 
nature.” He found, for example, that electricity from a friction machine 
would deflect a galvanometer and would cause chemical decomposition 
Just as electricity produced by chemical action would. This led him into 
the field of electrolysis, He found that many substances, e.g., certain 
chlorides and sulfates, are nonconductors when solid but good conductors 
when melted and that in the molten state they are decomposed by the 
passage of current. This showed that water was not essential to elec- 
trolysis. To clarify description of his experiments, he introduced the 
terms electrode, anode, cathode, ion, anion, cation, electrolyte, electro- 
chemical equivalent, etc. A quantitative study of the phenomena 
resulted in his discovery of the laws of electrolysis that bear his name and 
which are the basis of all present-day work in that field. 


Faraday recognized that a definite quantity of electricity is associated 
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with each atom or ion in electrolysis. Had he been able to determine 
the number of atoms in unit mass of any substance, he would have 
anticipated, by 60 years, the determination of the fundamental-charge e. 


For he says: p 
Equivalent weights of bodies are simply those quantities of them 
which contain equal quantities of electricity; - - - it being the elec- 


tricity which determines the combining force. Or, if we adopt the 
atomic theory or phraseology, then the atoms of bodies which are 
equivalent to each other in their ordinary chemical action, have equal 
quantities of electricity naturally associated with them. 


e. The Conservation of Energy In connection with a proof of the fact 
that the electricity from the voltaic pile results from chemical action and 
not from mere contact of one substance with another, Faraday stated 
clearly the doctrine of the conservation of energy several years before 
the statement of Helmholtz. In 1840, he wrote: 
The contact theory assumes that a force which is able to overcome 
a powerful resistance . - . can arise out of nothing. . . . This would 
indeed be a creation of power, and is like no other force in nature. 
We have many processes by which the form of the power is so 
changed that an apparent conversion of one into the other takes 
place. . . . But in no case is there a pure creation or a production 
of power without a corresponding exhaustion of something to supply tl. 


f. The Faraday Effect Reference has already been made to Faraday’s 
abhorrence of the doctrine of action at a distance. He believed that 
if two electric charges attract each other, the medium between the two 
plays some important role. Presumably, therefore, the medium between 
two such charges is in a different state than it would be if the charges 
were not present; and if so, such an altered state should be detectable 
by observing the alteration in some physical property of the medium. 
As early as 1822, Faraday experimented with a beam of polarized light 
passing through a transparent solution carrying a current to see whether 
the current caused any depolarizing action. Although he repeated the 


` experiment several times in subsequent years, the results were all nega- 


tive. In 1845, he returned to the problem, but still with negative 
results. He then tried solid dielectrics between plates of metal foil 
connected to a powerful electric machine to see whether, under electric 
strains, they would show any optical effects. No results! In 1875, 
this effect was found by Kerr. 

Faraday then substituted a magnetic field for the electrostatic 
field to see whether the former would cause any depolarizing action on 
the beam of light. Various substances were tried but still with negative 
results. Finally, he placed in the magnetic field a piece of dense lead 
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glass. When the magnetic lines were parallel to the direction of the 
beam of polarized light, he observed that the plane of polarization was 
rotated. At last, he had found a relation between magnetism and light. 
This magnetic rotation is now known as the Faraday effect. Again, his 
persistent search, maintained during 20 years of repeated failures, was 
rewarded by the discovery of an effect in the existence of which he had 
had sublime confidence. 


g. Miscellaneous Among Faraday’s other researches may be men- 
tioned numerous investigations in chemistry; the liquefaction of several 
gases formerly thought “permanent”; the diffusion of gases through 
solids; self-induction; certain fundamental properties of dielectrics; dia- 
magnetism; distinction between anode and cathode in the electric dis- 
charge through gases at low pressure; vibration of plates; regelation of 
ice; alloys of steel; and optical glass. 

Well may this simple, modest, self-taught philosopher be given a 
conspicuous place among the great benefactors of mankind. 


1.27 Joseph Henry (1799-1878) 


Any account of Faraday’s work, however brief, should be accompanied 
by at least a mention of the researches of the American physicist Joseph 
Henry, whose memory is honored by the name of the unit of inductance, 
the henry, which bears to electrokinetics a relation identical with that 
of the farad to electrostatics. Had Henry been able to experiment 
continuously, and with more resources, instead of only during a summer 
yacation of 1 month while teaching mathematics at Albany Academy, 
and then only with such apparatus as he could make with his own hands, 
he would undoubtedly have anticipated Faraday in the discovery of 
electromagnetic induction, including the phenomena of self-induction. 
In all his work, furthermore, he was greatly hampered by his isolation 
from the scientific atmosphere of Europe. 

Henry was interested especially in the design and use of electro- 
magnets. He constructed the first electric motor operating by an electro- 
magnet, which rocked back and forth between two permanent magnets. 
He found that for maximum tractive effect, the cells of the battery and 
also the spools of the electromagnet should be connected in series if 
the magnet were a long distance from the battery, but they should be 
connected in parallel if the wires joining the magnet to the battery were 


short. His work on electromagnets led directly to the commercial devel- 
Nase of the telegraph. 
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1.28 James Clerk Maxwell (1831-1879) 


it would be difficult to pick out two eminent scientists whose beginnings 
differed from each other more than did Maxwell’s and Faraday’s. 
Faraday came of very humble parentage; Maxwell, from a long line of 
distinguished ancestors. Faraday’s early life was lived almost in poverty; 
Maxwell’s family had abundant means. Faraday received only the 
most rudimentary education; Maxwell was given every advantage of 
school and university. They differed also in their aptitude for scientific 
work. Faraday was one of the greatest exponents of experimental 
science, whereas Maxwell, although an able experimenter, is one of the 
great figures in theoretical physics. And yet both made indispensable 
and mutually supplementary contributions to the classical theory of 
electromagnetics. 

Maxwell early showed extraordinary interest in both theoretical 
and experimental research in physics. He invented a means of drawing 
certain types of oval curves, and a few years later he published a paper on 
“The Theory of Rolling Curves” and another on “The Equilibrium of 
Elastic Solids”—all before he was nineteen years old! During these 
same years he was also busy with experiments of many sorts, especially 
in his little laboratory in a garret on the family estate at Glenlair, where 
he spent his vacations. 

After 3 years at the University of Edinburgh, Maxwell entered 
Trinity College, Cambridge, from which he graduated in 1854 with high 
honors. He spent 4 years at Aberdeen and was professor for 5 years 
at King’s College, London (1860-1865); from here some of his most 
important papers were published, such as “Physical Lines of Force” 
(1862) and his greatest paper, “A Dynamical Theory of the Electro- 
magnetic Field.” After a retirement of several years, he was elected 
in 1870 to the newly founded professorship of experimental physics at 
Cambridge. In this capacity, he superintended the planning and equip- 
ping of the now famous Cavendish Laboratory, of which he was director T 
until his untimely death in 1879. 

A large proportion of Maxwell’s papers, over 100 in number, may be 
grouped under three headings: color vision, molecular theory, and elec- 
tromagnetic theory. 

The work on color vision was undertaken to make a quantitative 
study of the physical facts pertinent to the theory of color sensatiors 
proposed by Thomas Young, according to which any luminous sensation 
is the result of exciting in the eye three primary sensations, red, green, 
and violet. For this purpose Maxwell invented a “‘color box,” by means of 
which he could mix spectral colors. 
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Fig. 1.4 James Clerk Maxwell. 


Maxwell’s work on molecular physics is extensive. He discovered 
nd, in part, established theoretically the law- of the distribution of 
elocities among the molecules of a gas (Maxwell’s law). He showed 
hat when two gases are at the same temperature, the mean kinetic energy 
f translatory motion of their individual molecules is the same in both 

gases. From the kinetic theory of viscosity, he drew the surprising 
cnclusion that the viscosity of a gas is independent of density so long 
as the mean free path is not too large, and he verified this conclusion 
by experiment. He brought to bear upon the whole subject mathe- 
matical methods “far in advance of anything previously attempted on 
the subject”; indeed, he is the cofounder with Clausius (1822-1888) 
of the kinetic theory of matter. 

_ In electromagnetic theory, Maxwell’s great contributions were the 
“displacement currents” and the formulation of the general equations of 
the electromagnetic field, which led to the electromagnetic theory of 
light. In the preface to his treatise “Electricity and Magnetism,” he 
remarks, “Before I began the study of electricity I resolved to read n 
mathematics on the subject till I had first read through Faraday’ 
‘Experimental Researches on Electricity.” He became convinced that 
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Faraday was right in regarding the dielectric as the true seat of electrical 
phenomena and in supposing that it acted by becoming electrically 
polarized, the positive ends of its molecules pointing on the whole with 
the field and the negative ends in the opposite direction. (The term 
dielectric, as used here, must be understood to include a tenuous medium, 
or ether, filling all space, even in what we call a vacuum.) He drew 
the conclusion that when the polarization changes, this change must 
involve a displacement of electricity, and so there must exist in the 
dielectric, while the change is going on, a current having the same mag- 
netic properties as the current in a conductor. 

This assumption of displacement currents opened the way for the 
deduction of Maxwell's famous equations of the electromagnetic field. 
It is interesting, however, that he was first led to these equations through 
a mechanical analogy, i.e., in studying the behavior of a mechanical sys- 
tem filling all space, which would be capable of causing the observed 
electrical and magnetic phenomena. He showed (1862) that his hypo- 
thetical medium would be capable of transmitting transverse vibrations 
with a speed equal to the ratio of the electromagnetic to the electrostatic 
unit of charge. Although he did not take his model too seriously, he 
nevertheless remarks that the ratio of the units 


. agrees so exactly with the velocity of light calculated from the 
optical experiments of M. Fizeau, that we can scarcely avoid the 
inference that light consists in the transverse undulations of the same 
medium, which is the cause of electric and magnelic phenomena. 


The theory was restated, without reference to any particular model, in 
his great paper of 1864, in which he says: 
The theory which I propose may therefore be called a theory of the 
Electromagnetic Field, because it has to do with the space in the 
neighborhood of the electric or magnetic bodies, and it may be called 


a Dynamical Theory, because it assumes that in that space there is 
matter in motion by which the observed phenomena are produced. 


In 1873, Maxwell published his “Treatise on Electricity and Magnetism,” 
one of the most important books in all science. 

The physical ideas underlying Maxwell’s new theory were left none 
too clear by him. In his treatise, we find the assumption that all space 
is full of incompressible “electricity”; in a conductor this electricity 
can move freely (except for ohmic resistance), thus constituting an 
electric current, but in a dielectric “‘there is a force which we have called 
electric elasticity which acts against the electric displacement and forces 
the electricity back when the electromotive force is removed.” This 
is clear enough. But what is the origin of this electromotive force in 
empty space? And in what does electrification consist? Maxwell 
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near to have supposed that when a conductor is charged by allowing 
a current to flow onto it through a wire, what really happens is that 
electricity flows into the conductor along the wire and displaces some 
of the electricity out of the conductor into the space surrounding it. 
Thus a charged conductor would contain neither more nor less electricity 
than an uncharged one. Such a hypothesis concerning electrification 
became improbable when in 1876 the American physicist Rowland showed 
that a moving charged conductor is surrounded by a magnetic field, the 
moving electrification evidently constituting a current. 

The mathematical theory, on the other hand, was slowly developed 
by others, especially by H. A. Lorentz, and was shown to give a good 
account of electric and magnetic phenomena and of the principal prop- 
erties of light. In Germany, stimulated by Helmholtz, Hertz set out 
to search experimentally for the magnetic effects of Maxwell’s displace- 
ment currents, and in 1887 he produced electromagnetic waves from 
electric circuits. Later it was shown that the speed of propagation of 
these waves is the same as that of light. Speculation as to the nature 
of the displacement currents in a vacuum gradually died out, until today 
it is usual to speak only of electric and magnetic fields governed by the 

. Maxwell-Lorentz equations. This development in electromagnetic the- 
ory illustrates a tendency, notable during the last century but regretted 
by many, for physics to become highly abstract and mathematical. 


1.29 Clouds over Classical Physics 


Before the end of the nineteenth century physics had evolved to the 
point at which mechanics could cope with highly complex problems 
involving macroscopic situations, thermodynamics and kinetic theory 
were well established, geometrical and physical optics could be understood 
in terms of electromagnetic waves, the atomic foundation of chemistry 
had been laid, the conservation laws for energy and momentum (and 
mass) were widely accepted, and classical physics had reached a proud 
maturity. So profound were these and other developments that not a 
few physicists of note believed that all the important laws of physics 
had been discovered and that, henceforth, research would be concerned 
with clearing up minor problems and, particularly, with improvements 
of methods of measurement. At that time few could foresee that the 
world of physics was on the eve of a series of epoch-making discoveries 
destined on the one hand to stimulate research as never before and on 
the other to usher in an era of the application of physics to industry 

on a scale previously unknown. _ ; N 
Late in the century x-rays and radioactivity were discovered. ° 
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one had yet been able to explain quantitatively photoelectric phenomena, 
blackbody radiation, and the origin of spectral lines. Nevertheless, 
many believed that some future genius would find the answers in classical 
physics. As late as 1899 Michelson said: 


The more important fundamental laws and facts of physical 
science have all been discovered, and these are so firmly established 
that the possibility of their ever being supplanted in consequence of 
new discoveries is exceedingly remote. . . . Our future discoveries 
must be looked for in the sixth place of decimals. 


But others were not so sure. In a lecture delivered in 1900 Lord 
Kelvin began with the words, ‘‘The beauty and clearness of the dynamical 
theory, which asserts light and heat to be modes of motion, is at present 
obscured by two clouds.” The first of these clouds was the question of 
“how the earth [can] move through an elastic solid such as essentially 
is the luminiferous ether” (in which electromagnetic waves were assumed 
to propagate). The second was the failure of the “Maxwell-Boltzmann 
doctrine regarding the equipartition of energy” to predict results con- 
sistent with experiment in all cases. 

Neither of these clouds were destined to be dissipated by classical 
approaches; new ideas were required. The first evaporated quickly 
when Einstein advanced the theory of relativity, introduced in the next 
chapter. The second evaporated more slowly, requiring not only Planck’s 


introduction of the energy quantum hv but also the evolution of a new 
mechanics. 
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Introduction to Relativity 


The theory of relativily put forward by Einstein in 1905 represented 
a vast revolution in physical thought. Relativity touches all branches of 
modern physics and plays a major role in many. 


2.1 Galilean-Newtonian Relativity 


Motion has been a subject of speculation since ancient times. It was 
early recognized that all motion involves displacement relative to some- 
thing or other, but ideas have varied in regard to the entity relative 
to which the displacement occurs. In his treatise on mechanics, Newton 
says that ‘‘absolute motion is the translation of a body from one absolute 
place to another absolute place,” but he does not explain what he means 
by “absolute place.” He states explicitly that translatory motion can 
be detected only in the form of motion relative to other material bodies. 
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Motion involves, also, the passage of time. Until Einstein, time 
was regarded as something entirely distinct from space or from the 
behavior of material bodies. Newton says, “Absolute, true, and mathe- 
matical time, of itself, and by its own nature, flows uniformly on, without 


regard to anything external.” Thus there was supposed to be a single 
time scale valid everywhere. 


Consider an event suc 
or the decay of a radioac 
tion of an event we make use of a referen 
be measured. To define a time, 
Process, e.g., the rotation of the earth, j 
specified. The material means of fixin iti i 
with the methods adopted for using them 
time frame of reference. When Ne i 
in any particular frame of referenc 
Einstein called such frames galilea 
lean law of inertia holds in them. 
newlonian frames would be a more ap; 
is inertial only if Newton’s laws of m 


Now suppose we have two frames of reference each in uniform 
translatory motion relative to the other. Let us call the two frames § 
and S’ and let the velocity of S’ relative to S be V. Let coordinates 
and times of any event obtained when the frame S is used be denoted 
by x, y, z, t and those obtained for the same event wh 


en S’ ig used by 
z’, y', 2’, U. To make the relation between these variables ds simple as 


possible, let us choose our axes so that the z and 2’ axes s 


are parallel to 
V and thus slide along each other; let the y' and z’ axes be parallel to 


y and z, respectively (Fig. 2.1). Let us also count time from the instant 
at which the origins of the coordinates O and O’ coincide. Then the 


Fig. 2.1 Inertial frame S’ moving with con- 
stant velocity V along z axis of inertial frame S. 
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coordinates of O’ as measured in frame S are z = Vtiy=0,z=0. 
If an event occurs at a position and time specified by z, 
frame S, according to classical physics the coordinates of 
frame S' are z’, y’, z’, t, where 


Y, z, and din 
the event in 


g=n— Vi v=2 galilean 


moe U = { transformation cin 


I 


These equations are the space-time transformation of classical 
mechanics. They enable us to pass from space-time coordinates of 
events in one reference frame to the space-time coordinates referred to 
another frame moving with constant velocity V relative to the first. 
Under the galilean transformation the laws of classical mechanics are 
the same in the two frames S and S’, since mechanical forces and accelera- 
tions are the:same in both frames. As we shall see, the classical laws 
are approximate only. 


2.2 Galilean Relativity and Electricity 


The laws of classical electromagnetism are n 
galilean transformation. For example, imagine a charge Q a distance a 
from an infinite wire bearing a charge p; per unit length, with both Q 


and the wire at rest in inertial frame S (Fig. 2.2). The charge Q experi- 


ences an electric field pi/2rea and no m i i 
2 agnetic field 
charges are at rest, g ie 


ot compatible with the 


with fC jae charge Q a distance a from an i 
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If S’ is moving to the right relative to S with speed V, all charges 
have a speed V to the left in S’, and Q is then moving in a magnetic 
field given by 


Bol pops V galilean 
2ra 2ra transformation 


According to the Lorentz equation, F = Q(E + v x B), so that the 
force on Q in frame S’ is the resultant of that due to the electric intensity 


(which is the same in S’ as in S) and that due to the magnetic field 
above. Hence the charge Q experiences a net force: 


in frame S’ 


y? ; 
F; = ye 7 — Ori galilean 


while in frame S 


F, = a a galilean 


Clearly, classical electromagnetic theory, together with galilean rela- 
tivity, predicts different forces on Q in the two inertial frames; on the 
other hand, Eqs. (2.1) predict the same acceleration. Thus something 
is not right—either in the galilean transformation or in classical electro- 
magnetism or in newtonian mechanics. We say that the laws of classical 
electromagnetism, together with newtonian mechanics, are not invariant 
under the galilean transformation. 


2.3 Relativity and the Propagation of Light 


With, the adoption of the wave theory of light, a new element was 
brought into the problem of motion. For if light consists of waves in 
an ether (Sec. 1.24), these waves should have a definite speed relalive to 
the ether, and their speed relative to material bodies should change when 
the motion of these bodies through the ether varies, Analogous state- 
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small compared with the velocity of light; however, the speed of the 
earth in its orbital motion about the sun is about one ten-thousandth of 
the speed c of light in free space. 

An interesting case to consider is the formation of images by the 
object lens of a telescope. Suppose a light wave from, a star enters 
the telescope sketched in Fig. 2.3. When the telescope is at rest in the 
ether, let the wave come to a focus so as to form a star image on the 
cross hairs at P. When the telescope is moving toward the star, the 
wave might be expected to focus on the same point P in the ether as 
before. While the wave is passing from the lens to this point, however, 
the telescope moves forward, carrying the cross hair to some other 
point P’. The eyepiece would therefore have to be drawn out farther 
in order to focus on the image of the star. Thus the focal length of the 
telescope would appear to be increased. Similarly, if the telescope were 
moving in the same direction as the light, its effective focal length would 
be-#kxtened. Thus, as an astronomical telescope is pointed at stars 
in different directions, its apparent focal length might be expected to 
vary slightly because of the earth’s orbital motion. This effect was 
looked for long ago by Arago but in vain. 

This and other negative results led Fresnel in 1818 to propose that 
moving transparent bodies may partially drag the light waves along 
with them. In the case just considered, if the lens L were to drag the 
light with it in its motion (toward the right in Fig. 2.3), the part of the 
wave that goes through the center of the lens would spend a longer time 
in the lens, and hence would be retarded more than it would be if there 
were no motion. Consequently the wave would emerge from the lens 
more concave in shape and would focus on a point nearer the lens. If 
the amount of the drag were just right, there would be no effect at all 
on the apparent focal length, the star image falling on the cross hair 
however the telescope might be moving. Fresnel showed that all effects 
on phenomena of refraction would be prevented if it were a law of optics 
that any moving transparent medium of refractive index n changed the 


Fig. 2.3 Effect of the motio: 
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velocity of light in such a way as to add vectorially to its velocit. n the 
stationary medium the fraction 


1 
Pe ad 
of the velocity of the medium. That is, in a medium moving with 
vector velocity u, the vector velocity of light in any given direction is the 
sum of a vector of magnitude (1 — 1/n?)u and a vector of magnitude c/n. 

In 1851 Fizeau showed, by an interference method, that a moving 
column of water does drag the light waves with it to the extent required by 
Fresnel’s hypothesis! The cause of the drag was assumed by Fresnel to 
lie in an actual partial dragging of the ether itself along with the moving 
medium. When Lorentz worked out his electromagnetic theory, how- 
ever, about 1895, on the assumption of a stationary ether, he found.that 
the theory led automatically to Fresnel’s formula for the light drag. 

No effect of the earth’s orbital velocity on terrestrial optical phe- 
nomena has been observed. It is well known that annual changes in the 
direction of the earth’s orbital motion Cause a variation in the apparent 
positions of the stars, called stellar aberration (Sec. 1.19), but that is 
another story. It may be doubted whether the early experiments were 
sufficiently precise to detect a possible effect due to the earth’s motion, 
but at least one test of adequate delicacy was made in 1871 by Airy, 
who filled a telescope tube with water and observed that star images 
were formed in the same position with or without the water, in spite of 
the longer time required for the light to reach the focal plane when 
traveling through water instead of air. 

Fresnel and Lorentz showed that there should never be any effect 
on optical phenomena that is of the first order in the velocity of the 
apparatus through tlie ether. There might, however, be second-order 
effects. Since the square of the earth’s orbital velocity is only 10-*?, 
such effects would be difficult to detect. In seeking a sufficiently 
delicate means of observation, Michelson invented his interferometer, 
with which in 1887 he and Morley performed a famous experiment which 
could have detected a second-order effect if it had been present. 


2.4 The Michelson-Morley Experiment 


In the interferometer arrangement used in this experiment (Fig. 2.4) a 
beam of light from a lamp L falls upon a hal 


k f-silvered glass plate P, 
placed at 45° to the beam, which divides each wave into two parts. 


One partial wave, reflected from P, travels off sideways to a mirror Mi, 
by which it is reflected- back to P; part of it is then transmitted through 
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Fig. 2.4 The Michelson-Morley experiment. 


P and enters the telescope T. The other part of the original wave, 
transmitted at once through P, travels ahead and is reflected back by a 
second mirror M2; upon returning to P, it is partially reflected into the 
telescope on top of the first part of the wave, with which it forms an 
interference pattern. 

Both mirrors are at the same distance from the plate P If the 
apparatus is at rest in the ether, the two waves take the same time to 
return to P and meet in phase both there and in the telescope. Suppose, 
however, that the apparatus is moving with speed V through the ether 
in the direction of the initial beam of light. Then, if the initial wave 
strikes the plate P when it has the position shown in the figure, the 
paths of the waves and the subsequent positions of reflection from mirrors 
and plate will be as shown by the dotted lines. The necessary change 
in the direction of the transverse beam is produced automatically, as an 
aberration effect, through the operation of Huygens’ principle. But now 
the times taken by the two waves on their journeys are no longer equal. 
The wave moving longitudinally toward M+» has a velocity relative to the 
apparatus of c — V on the outgoing trip, c being the speed of light 
through the ether, and c + V on the return trip; hence the time required 
by this wave is 
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l being the distance from the plate to either mirror. The wave moving 
transversely, on the other hand, travels along the hypotenuse of a right 
triangle having a side of length l. Let it take a time ¢’ to go from P 
to Mı, traveling a distance cl’. Then in the same time the mirror M, 
advances a distance Vi’. Hence, since ct’? = V7’? + l°, this wave 
returns to P after a time 


2l Al yz eo 
= 2 = Goya A l+gat ) 


Thus the two waves interfere in the telescope with a phase difference of 
2re(la — l1)/à = 2nxlV2/c?d rad (à being the wavelength of the light), 
and the fringe pattern is shifted by the motion through V?l/c?) fringes. 

In performing the experiment, the whole apparatus, floated on 
mercury, was rotated repeatedly through 90°. Since the two light paths 
are caused to exchange roles by such a rotation, it should cause the 
fringe pattern to shift twice as much; by reflecting the beam back and 
forth several times, the effective length | was increased to 11 m. Even 


then, with a wavelength of about 5.9 X 10-5 cm, if we insert for V the 


whole orbital velocity of the earth so that V/c = 10-4, we find a shift 
of only 


N=2x 10-8 os = 0.37 fringe 


Michelson and Morley were sure that they could detect a shift of a 
hundredth of a fringe. Such shifls as were observed amounted only lo a 
small fraction of the theoretical value and were not consistent. Thus the 
result of the experiment was negative; the expected effects of motion 
through the ether did not appear. 

It might happen, to be sure, that at a given moment the earth by 
accident had no resultant component of velocity parallel to the surface of 
the earth; for upon its orbital motion there would probably be super- 
posed a motion of the entire solar system through the ether. But, if 
this happened at a certain time, then 6 months later the earth’s orbital 
velocity about the sun would be reversed, and its velocity through the 
ether should be twice its orbital velocity. 

Observations made by the Michelson method at various times of 
day and at different seasons of the year by physicists in different parts 
of the world always yielded the same negative result. In 1930 Joos 
in Germany found no evidence of motion through an ether with an inter- 
fecometer capable of detecting an ether drift one-twentieth of the orbital 
velocity of the earth. In 1958 Cedarholm and Townes devised a new 
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technique using masers, with which an ether drift of one one-thousandth 
the orbital velocity could have been observed, but no evidence of any 
drift was found. 

The failure to detect the anticipated motion of the earth through 
the ether was an important experimental result, very hard to bring 
into harmony with existing theories of light and matter. The theoretical 
calculation rests on a peculiarly simple foundation, for the only properties 
of light that are made use of are the constancy of its velocity in space 
and Huygens’ principle. Classically, three possible explanations of the 
negative result seem to offer themselves. 

1. Perhaps the earth drags the ether with it, much as a moving 
baseball carries along the air next to it. On this assumption there would 
never be any motion of the earth through the ether at all, and no diffi- 
culties could arise. One objection to this explanation is that the ether 
next to the earth would then be in motion relative to the ether farther 
away; and this relative motion between different parts of the ether 
would cause a deflection of the light rays coming from the stars, just as 
wind is observed to deflect sound waves. This deflection would alter 
the amount of the stellar aberration. It was difficult. to devise a plausible 
motion for the ether which would give a value of the aberration agreeing 
with observation and yet preserve the negative result of the Michelson- 
Morley experiment. A second objection arises from the fact that, as 
has already been stated, a transparent object of laboratory size does 
not drag the light waves with the full velocity of the moving matter, 
as it necessarily would do if it completely dragged the ether along with 
it; and the observed partial drag is fully accounted for by current electro- 
magnetic theory. 

2. As an alternative, we could assume that light projected from a 
moying source has added to its own natural velocity the velocity of the 
source, just as the velocity of a projectile fired from a moving ship is 
equal to the vector sum of its velocity of projection from the gun and 
the velocity of the ship. If this were true, the negative result obtained 
by Michelson and Morley would at once be explained, for the light from 
their lamp would have always the same constant velocity relative to 
the lamp and to the interferometer. Such an assumption, however, is 
in gross conflict with the wave theory of light. It is of the essence of 
waves in a medium that they have a definite velocity relative to the 
medium, just as sound waves have a definite velocity relative to the air. 
Furthermore, there is strong experimental evidence against the assumption 
in question, which we have not space to describe. 

3. The third possible explanation put forward independently by 
FitzGerald and by Lorentz, is that motion through the ether might in 
some manner cause the material composing the interferometer to shorten 
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in a direction parallel to the motion. Such a contraction in the ratio 
vI — V*/c? would serve to equalize the light paths and thus to prevent 
a displacement of the fringes. This explanation was shown to be inade- 
quate in 1932 by Kennedy and Thorndike, who found no shift even when 
the two arms of the interferometer had grossly different lengths. 


2.5 The New Relativity of Einstein 


From the situation just described one easily gains the impression that 
there exists something like a conspiracy in nature to prevent us from 
detecting motion through fhe ether. A similar situation can be shown 
to exist in the field of electromagnetism as well as in optics. A number 
of electrical or magnetic, experiments can be invented which, at first 
sight, offer promise of revealing motion through the ether; but always 
there occurs some other effect which just cancels the effect sought. 

In reflecting upon this extraordinary situation, Einstein arrived in 

1905 at a radically new idea. He proposed that motion through an 
ether filling empty space is a meaningless concept; only motion relative lo 
material bodies has physical significance. He then considered how this 
assumption could be made to harmonize with the known laws of optics. 
Possibilities of conflict arise in any argument involving the velocity of 
light. Consider, for example, a frame of reference S’ (say on the earth) 
moving relative to another frame S (say on the sun), and suppose that S’ 
carries a source of light. Then light from this source must move with 
the same velocity relative to S as light from a source on S itself, since, 
as explained above, we cannot suppose that the velocity of light is influ- 
enced by motion of its source. But this light must also move with the 
same velocity relative to S’; otherwise the laws of nature would not be 
the same on S’ as on S, and no reason could be assigned for their being 
different. Yet it seems quite impossible that light should move with 
the same velocity relative to each of two frames that are moving relative 
to each other! 

Einstein accordingly put the laws of the propagation of light in the 
forefront of the discussion. He based his new theory, which is known 
as the special or restricted theory of relativity, upon two postulates which 
inay be stated as follows: 

1. The laws of physical phenomena are the same when stated in terms of either of 
lwo inertial frames of reference (and can involve no reference to motion through an ether); 
no experiment can be performed which can establish that any frame is at absolute rest. 

2. The speed of light is independent of the motion of its source and is the same for 
observers in any inertial frame. 


4 
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Of these two postulates, the second is believed to represent an 
experimental fact, whereas the first is a generalization from a wide range 
of physical experience. The first postulate is in no way self-evident; 
like the assumptions made in all physical theories, it is a hypothesis to be 
tested by experiment. Further, some of the apparently well-established 
laws of classical physics, such as the law of conservation of mass, fail 
to meet the requirement of the first postulate. Indeed, the usual form 
of Newton's second law of motion proves to be incompatible (see Sec. 
3.3). 


2.6 Simultaneity and Time Order 


Einstein found the key by which these two postulates could be brought 
into harmony by giving up the newtonian conception of absolute time. 
Newton had supposed that it is always possible to say unambiguously 
which of two events precedes the other or that they occur simultaneously. 
But how can this be done if the two events occur in widely separated 
locations? 

In practice, times at two different places are compared by reference 
to a clock at each place. It is necessary to synchronize these clocks, 
which in modern practice is done by means of electromagnetic signals. 
For precision, however, it is necessary to correct for the time required for 
light to travel from one place to the other; and making this correction 
requires a knowledge of the velocity of light in one direction. Ordinary 
measurements of the velocity of light furnish only its average velocity 
in two opposite directions. We could measure its velocity in one direction 
if we had our clocks already synchronized, of course, but this leads into 
a vicious circle. 

As an alternative, we might carry a chronometer from one place 
to the other and set both clocks by comparison with the chronometer. 
But how could we prove experimentally that the chronometer runs at a 
constant rate while moving in various directions? Every method that 
can be devised to prove this or to measure the velocity of light in one 
direction turns out to rest upon some fresh assumption that cannot be 
tested in advance. It follows that a gencral time scale can be set up 
only on the basis of an artificial convention of some sort. The simplest 
procedure is the usual one of assuming the velocity of light to be the 
same in any one direction as in the opposite direction. We shall see, 
however, that a time scale set up on this basis varies somewhat according 
to the frame of reference that is used in measuring velocities; even the 
time order of two events may be different in some cases. 


56 Introduction lo Modern Physics 


Before developing the mathematical theory further, the following 
additional remarks may be of interest. Consider only events that hap- 
pen at various points along a straight track, so that the situation can be 
diagrammed in two dimensions. On a plot (Fig. 2.5) let horizontal dis- 
tances represent displacements along the track and vertical distances 
lapses of time. Consider an observer at the point Poat time to. Through 
Po dra two lines representing the progress of two light signals traveling 
in opposite directions along the track which pass Po at time lọ Then 
points on the plot in the area above these two lines will represent possible 
events which, at time to, certainly lie in the fulure for any observer at 
Po. These events occur late enough so that the observer still has an 
opportunity to influence them causally, perhaps by sending a radio 
signal. Below both lines, on the other hand, lies the observer's definite 
past. This area consists of events on the track which may have had a 
causal influence on events in his neighborhood at time lọ and of which 
he may have already acquired knowledge at that time by means of radio 
signals. Between the lines, however, lies also a third region, representing 
what might be called the observer's physical present. At time ly it is too 
late for him to influence events plotted in this region, but. also too early 
for him to have any observational knowledge of them. For events 
located at a given point of the track, the observer’s physical present 
covers a range of time, such as QR, that increases with increasing dis- 
tance from Po. If the observer is in New York, this interval is about 
zo S for events in San Francisco (the time required for light to make 
the double journey); it is about 16 min for events on the sun. Here we 
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Fig. 2.5 The classification of space-time relations. 
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have an essential difference between Einstein’s relativity and Newton’s; 
for in the older theory the physical present was reduced to a single line. 


2.7 The Lorentz Transformation 


Like all physical theories, relativity must stand or fall according to 
whether or not the deductions drawn from it agree with experimental 
facts. To make deductions from relativity, we compare descriptions of 
some phenomenon in terms of two inertial frames which are in relative 
motion. We need to know, first of all, how measurements of space and 
time compare. (In discussing these, we speak of an observer who makes 
measurements referred to a particular frame. This does not imply that 
relativity has any closer connection with human psychology than the 
rest of physics does.) i 

Einstein’s second postulate requires that the speed c of light in 
vacuo be the same in all directions and for observers in all inertial 
frames; thus c becomes a universal constant. 

Consider again the inertial frames S and S’ of Sec. 2.1. Let the 
two origins coincide at ¢ = 0, ť = 0, at which instant a light pulse is 
emitted from the common origin. Imagine that observers in S and S’ 
have arranged apparatus which enable them to follow the pulse as it 
moves outward from the source. By Einstein’s second postulate, obser- 
ver O in frame S and observer 0’ in S’ find the locus of the wavefront 
to be given respectively by 


e+y2+22—c?=0 (2.2a) 
a? + y!? + 2/2? — cl? = 0 (2.26) 


Thus each observer finds the wavefront to be a sphere centered at his own 
origin, even though the origins of the two systems no longer coincide! 
- Direct substitution of the galilean transformation into Eq. (2.26) 


| yields z? — 2aVi + VP + y? + 2? — c#l? = 0, which is clearly incom- 


‘patible with Eq. (2.2a). The equations y = y’ and z = z’ are not the 
source of the incompatibility, and we accept them without proof; powerful 
arguments based on the isotropy of space can be advanced for their 
validity. Therefore we seek compatible relations between 2’, x, t’, and i. 
As the origins pass, z = 2’, and we choose t = 0 = t’. Because of the 
hon ogeneity of space and of the uniformity of natural laws in time we 
assume the relationships are linear and try 


t 


It 


zg’ = ar + qt (2.3a) 
t=eat+r7t (2.30) 
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where a, 7, e and y are constants to be determined. At the origin of 
S’, z' = 0 and z = Vi, so that by Eq. (2.3a) 0 = aVi+ nt. Thus 
n = —aV, and so z’ = a(z — Vi). Inserting this value of x’ and Eq. 
(2.36) into Eq. (2.26) yields 


atz? — 2a*Val + a? V7? + y? + 2? — cter? — Icreyat — cyl = 0 
This result is compatible with Eq. (2.2a) only if 
at — cte = aV + cey = 0 cy? — aV = et 


These three equations can be solved for the three unknowns a, Y, and e 
in terms of V and c and give 


aa a E a 
Vi- Ve E AVA 
All constants are now determined, and we have 


jo a a 
z Vi — Và = v(x Vb) 


he Pe Lorentz 
mee z f transformation Go) 


These equations, named the Loreniz transformation by Poincaré, were 
discovered by Voigt in 1887 and independently derived by Lorentz in 
1904 in the course of his study of matter moving in an electromagnetic 
field. Lorentz assumed one frame of reference to be at rest in the ether 
and attached an immediate- physical meaning only to measurements 
made with this frame. The new principle of relativity implies that all 
inertial frames are to be treated on an equal footing. 

By solving Eqs. (2.4) for z, y, z, t, we obtain the inverse trans- 
formation: 


z= ylz + Vt’) yy’ (2.4a) 
t= r(e +) z=7 


From these equations, we can easily confirm the statement that 
events which happen at the same place at different times, as viewed from 
one frame, may be seen from another frame to happen at different places 
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as well. Similarly, a difference in spatial position with respect to one 
frame may correspond to a difference in both space and time with respect 
to another. Thus a space difference can be converted partly into a 
time difference, or vice versa, merely by changing the frame of reference 
that is being used. For this reason it has become customary to speak 
of space and time as aspects of a four-dimensional continuum known 
as space-lime. 


2.8 Space Contraction and Time Dilation 


In two famous cases the Lorentz transformation leads immediately to 
results of special interest. 

Consider a body which, when at rest in S, has a length Lo in the 
direction of the x axis. Let it be set moving relative to S at such speed 
that it is at rest in S’. Its length as measured in S’ will now be Lo; for its 
length is determined by certain natural laws and hence must have a cer- 
tain fixed value in any frame in which the body is at rest. Let.us see 
how the length now measures in S, relative to which the body is moving 
with speed V. To do this, we must first consider what we mean by the 
length of a moving object. It seems reasonable to define the length as 
the distance between two points fixed in S which are occupied by the 
ends of the object simultaneously, i.e., at the same time t. If the coordi- 
nates of these points are zı and zz, the length is then L = z — zı. By 
the same definition, since the body is at rest in S’, its ends have fixed 
coordinates xj, z, such that Lo = z$ — zi. If we substitute values of 
z, and z; from Eq. (2.4) for a given value of t, we obtain 


Lo = y(z2 — 21) = yL = L or 


Vi — V3/e 1 — V?/c? 
L= Le. (2.5) 


From this equation we may draw two distinct conclusions. First, 
any body measures shorter in an inertial frame relative to which it is 
moving with speed V than it does as measured in a frame relative to 
which it is at rest, the ratio of shortening being V/I — V?/c%. This is 
a relation between measurements referred to different frames. Second, 
relative to a single frame, any physical body set into motion with speed 
V shortens in the direction of its motion, as was postulated by Fitz- 
Gerald and Lorentz, in the ratio V/I — V?/c#. In one sense, the con- 
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traction is perhaps not a “real” one, since in a frame in which it is at 
rest the body measures the same as before; but as far as effects on sur- 
rounding bodies are concerned, the contraction’ is as real as if it were 
due to a drop in temperature. 

Next consider a vibrator, such as a good crystal clock or a radiating 
atom, whose frequency is determined by natural laws and hence is the 
game in any inertial frame in which the vibrator is at rest. Let the 
vibrator be stationary in S’. Then the time T for one vibration as 
measured in S can be written ł: — 4, where, using Eq. (2.4a) for t and 
lı with z’ held constant, 


T! 
T=h—-h=y7%—t) = VEJE (2.6) 


T’ representing the period as measured in S’. This result has again a 
meaning beyond a mere relation between measurements. For T” is also 
the period that the vibrator would have in S if it were stationary in 
that frame. Thus, relative to S, setting it in motion has lengthened its 
period of vibration in the ratio y. More generally, when system A is 
in uniform translational motion at speed V relative to system B at rest 
in an inertial frame, all natural processes in A as observed from B are 
slowed down in the ratio (1 — V2/c?)}; similarly, processes in B as 
observed from A are slowed down in the same ratio. For example, 
spectral lines from moving atoms show a relativistic doppler shift which 
includes a slight displacement toward the red due to time dilation. 
Muons are unstable particles with a rest mass 207 times that of 
the electron and a charge q + 1.6 X 10-19 C. Muons at rest decay into 
electrons or positrons with an average lifetime of about 2 us. However, 
for muons moving with a speed of 0.98c, the measured life is 5 times 
longer. Experimental confirmation of the time dilation of Eq. (2.6) 
comes from determination of the lifetimes of several unstable particles. 
Time dilation effects are as “real” as those due to a slowing down of any 
other sort. As a further example, suppose a good crystal clock were 
subjected to steady translation for some time, first in one direction and 
then in the opposite, until it returned to the starting point. As judged 
by clocks that did not share its variable motion, it would lose time, and 
i i j i ith a speed 
Se ee ch ys ied tC BI Peotone (a tha drcction of 
approaciing rd with Eq. (2.5). However, in 1959 Terrell pointed out that what any 
motion m nooo! t “sees” depends on the light reaching the controlling aperture at a 
op goal aT Light from different parts of the same object will have left the 
particular instante ae that from greater distances having left earlier than that from 
object at ere i ends to a distorted “picture” in which the object appears 
nearer PO See Phys. Today, 18(9): 24 (1960). 
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after its return it would be behind a similar clock that had remained 
at rest. 


2.9 Velocity Transformations 


Poincare used the Lorentz transformation to derive important formulas 
for the transformations of velocities from frame S to frame S’. By 
definition, 


_ dz _ dy _ dz 
vage a A 
adta dy’ “lide. 


E (Pare oe RE aol: 

On the other hand, from Eq. (2.4), dy’ = dy, dz’ = dz, and 
dz! = y(de — Vl) dl’ = yidi — (V dx)/c'] 

where y = 1/+/I — V?/c?. Then we have 


,_ de _ _y(da = V di) dx/dt — 2 —V 
s= a aha dso] LH y Pree a) ~ 1— Vo,/e 


and in general, 


v — V vy Us 
v y — Vvz/c°) 
(2.7) 


E o E 4 = Vole) 


As a numerical example, consider two electrons ejected from a 
radioactive source at rest in S, one toward —z with vs = —0.8c and the 
other toward +z with vs = 0.9c. Then the difference in their velocities, 
measured in S, is 1.7c. But if we make V = —0.8c so that the S’ 
frame keeps up with the electron going toward —z, the velocity of the 
second electron relative to the first (measured now in S’) is 


0.9¢ — ae 0.8c) Soe 


When v, and V are small compared with the speed of light, Eqs. 
(2. 7) reduce to the classical velocity transformations: v, = ve — 


A = 13%, = Up 
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2.10 The Variation of Mass 


The laws of mechanics as they left the hands of Newton are not in 
harmony with the new theory of relativity. One approach for cor- 
recting them-was discovered originally in studying the motion of charged 
particles in electromagnetic fields, but they can also be deduced from a 
study of ordinary mechanical phenomena. 

For this purpose, we select for study an experiment the outcome 
of which can be inferred from considerations of symmetry. Consider 
two identical balls A and B; let A be projected in the +y direction with 
a speed u in frame S and B be projected in the — y’ direction of frame S’ 
with speed u’. Assume that u’ = u, that both are small compared with 
V (the speed of S’ relative to S), and that the balls collide elastically 
with each other in such a way that the velocity of A is reversed in S 
while that of B is reversed in S’ (Fig. 2.6). As seen in S, the y component 
of the velocity of ball B is, by the inverse of Eq. (2.7), 


a= wy V1 = V2/c? _ He ee 
d IF Vol/c?_ a 


since v, = 0. If energy is conserved, the y components of the velocities 
of the spheres are simply reversed, while the z components are unchanged. 
Conservation of momentum requires that the y component of the momen- 
tum in frame S before the collision be equal to that after the collision, 


and so 
A y? y? 
mau — mgu |l — — = mau’ yi —— — mau (2.8) 
c Oe 


Since u’ = u by hypothesis, as viewed from S, mg is equal not to ma 
but to ma/+/1 — V?/c?. So long as u is arbitrarily small, we may 


Fig. 2.6 Elastic collision of identical spheres viewed from inertial frames S and 
S’ in relative motion. 
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Fig. 2.7 Ratio of the mass of a body to its 
rest mass as a function of v/c. 


replace ma by mo, where mo is the mass of one of the spheres at rest 
relative to S. Then clearly ms is a function of V, which we write m(V). 
From Eq. (2.8) we then have 


Mo 


V e2) 


Here mp is called the rest mass. 
The formula for m has been derived here for a special case; if we 
assume that it holds generally, we arrive at a relativistic mechanics 


m(V) = 


- . which is compatible with Einstein’s basic postulates. The momentum 


p of a body moving with velocity v is then 


Ci: ar MoV 
p = mv = qa aye (2.10) 
As v approaches c, the effective mass increases without limit (Fig. 2.7). 
No body with finite rest mass mo can quite achieve the speed of light; 
an infinite momentum is unknown. 


2.11 Force and Kinetic Energy 


All other mechanical terms and principles must now be scrutinized to 
see whether or not they require changes. 
In newtonian mechanics the force F is equal to the time derivative 
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of the momentum my. Now that we recognize that the mass m is a 
function of v, we have 


wa =m tv (2.11) 


In Chap. 3 we shall see that when we patiently apply the Lorentz trans- 
formatious to the right side of this relation, forces too are different when 
measured with respect to co-moving inertial systems. 

In relativity we retain the definitions of the work done by a force 
as JF - ds and of energy as the ability or capacity to do work.’ We can 
then compute the kinetic energy K of a body of rest mass mo and velocity 
v in the usual way by calculating the work done by a constant net 


force F acting on the body, initially at rest, until it attains the velocity 
v. This gives 


e feed de rid 
K= [Uy Rds = f° 5 moF dt = vg d= f rim 


since ds/dt = v, the instantaneous velocity. 
mass m from Eq. (2.9), we have then 


foim = f oiT 
mo fv la Zot Sron] dv 


we v vdv a 2 1 
= mo fy a= vex ~ Mr (1 = 0?/c?)t lo 


Inserting the value of the 


K 


Thus we find, for the kinetic energy of a body of which the reét 


a mass is 
mo, when moving with speed », 
i ar ‘ 

K = me Vi oa -1 (2.12) 


We can also expand in powers of v, obtaining i 
v? 3u 5v 
‘as -1+3 tgat pet 
so that 


ifp? EE EER 
K = gm 4c? ' Bcé 
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Thus, if »<c, K reduces approximately to the ordinary value, 3mov?. 
Under the same circumstances, the momentum can also be written, as 
usual, mov. In general, newtonian mechanics constitutes an approxi- 
mate form of mechanical theory valid for any motion which is slow as 
compared with the speed of light. 


2.12 Mass and Energy 


Combining Eqs. (2.9) and (2.12), we write 
K = (m — m))c? (2.12a) 


Thus the kinetic energy of a moving body equals c? times its gain in mass 
due to the motion. This relation suggests that we may think of the 
increase in energy as the actual cause of the increase in mass. It is then 
an attractive hypothesis to suppose that even the rest mass mo is due to 
the presence of an irternal store of energy of amount moc*. This may be 
called the rest energy of the body. 

The total energy of a moving body would then be myc? + K, or 


E = mè = er (2.13) 
and we can write for its inertial mass m and momentum p 

m= E (2.14a) 

Bee a (2.146) 
From (2.13) and (2.146) it also follows that 

E? = mjc + p%c? (2.15) 


The foregoing relations suggest that inertial mass may be a property 
of energy rather than of matter as such, each joule of energy possessing, 
or having associated with it, 1/c? kg of mass. This idea has proved very 
useful in dealing with nuclear phenomena. For instance, a y-ray photon 
impinging upon a nucleus can be converted into an electron and posi- 
tron. Here the energy of the photon reappears in part as the kinetic 
energy of the particles, in part as their rest-mass energy 2mc*. A 
photon with frequency » and wavelength à has energy E = hy and 


" momentum p = E/c = hv/c = h/d, where h is Planck’s constant. 
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The electron, the least massive of charged particles, was the first 
for which the relativistic change of mass with speed was investigated. 
The experimental results are in complete accord with relativity theory. 
Since high-energy charged particles are usually produced by accelerating 
them through potential differences, it is convenient to measure their 
energies—kinetic, rest, and total—in electron volts.! For example, the 
rest mass of an electron corresponds to 0.511 MeV. An electron given 
a kinetic energy of 1 GeV in an accelerator has a total energy mc? of 
1000.511 MeV; its mass m is almost 2000 times the rest mass. 

Actually the energies equivalent to the masses of elementary 
particles, such as the electron and proton, are known to higher precision 
in electron volts than the actual masses in kilograms. For this reason 
it is common to find the “mass” of a particle such as the proton quoted 
as 938 MeV. The masses of many elementary particles are listed in 
this way inside the back cover. 

The association of mass and energy is not limited to kinetic energy 
or to rest mass. Relativity requires mass to be associated with poten- 
tial energy as well. Lorentz and others explored the possibility that 
the rest mass of the electron might be potential energy arising from the 
charge distribution. The potential energy of a charge Q distributed 
uniformly through a sphere of radius r is $Q?/4eor; for other spherical 
distributions the ¢ was replaced by other numerical constants, but the 
Q?/4reor appeared repeatedly. Eventually it became customary to write 
for the electron, mc? = e?/4eoro, where 


e? 
ro = gpega = 2.81784 X 10-1 m- (2.16) 


is known as the classical radius of the electron. Experiments on the 
scattering of electrons by electrons at high energies have shown that the 
interaction remains coulomb repulsion down to separations of less than 
2 X 10-16 m, so that clearly the classical radius is several times too large 
to be consistent with electron-electron interactions. On the other hand, 
for scattering x-rays the effective radius of the electron is of the order 
of ro. 


te 
SDT Ns 
volt is the energy gai 


by 
potential difference of one voll; 1 eV = 1.602 X 107» J, 


omitted in shorter courses. 
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i ‘opposite velooiti ‘slong the zaii ‘ea i by an observer. collide” 
i her, a spring acting as.a buffer between them. Just as- 
they come to rest, let a lock snap shut and hold them combined into 
a single mass. The initial kinetic energy of the masses is converted 
by the collision into potential energy of the spring. Let us view this 
collision from a second frame of reference S’ that is moving with 
velocity V parallel to z. Then, if the velocities of the two bodies” 
before the collision, as measured in S, are 0; = 21 = 0, Us = Vea = —2, 
the same velocities as measured in S’ are, by Eq. (2:7), 


hath = ye = -v+(i- H =a 
v- V v 
= te = TEV aU esp 
| Therefore, 
vi Vv ; : 
(apc o 
(1-2 
[1 — (V? + Te + Vin? /c} 


the last term being obtained in this form after inserting for v under the 
radical the first expression given for vj above. Similarly, 


0 4 
C= 27 ~~ me) 
ds G-F 2 


=F BEE V 
The total momentum before collision is, therefore, 
movi mov, 
Vi= w/a VI — vP/c 
= —mV (4s + aration EAT 


On the other hand, if the rest mass of the combined body after ` 
“collision were merely the sum of the rest masses of the separate bodies, | 
L or 2mo, the total momentum after collision would be 


(2.17) 


—2mV 


Vi = Vie pees 
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Problems 


1. An electron is moving at a speed of 1.8 X 10° m/s. Find the ratio 
of its mass to its rest mass. What is its kinetic energy? Its total 
energy? 

Ans: 1.25; 128 keV; 0.639 MeV 
2. A proton is accelerated in a synchrotron until its kineti i 
3 4 etic energy is 
just equal to its rest mass (938 MeV). Find the ratio v/c for this promi 
3. An electron is accelerated to a kinetic energy of 1.00 X 10° eV 
(1 GeV, or as it is sometimes known in the United States, 1 BeV). F.d 
for this electron the ratios m/my and v/c. 

Ans: 1.96 X 107; 0.99999987 
4. Through what potential difference must protons be accelerated to 
achieve a speed of 0.600c? What is the momentum of such a proton? 
The total energy? 

Ans: 235 MeV; 3.76 X 10" kg-m/s; 1173 MeV 
5. Find the work which must be done on an electron to increase its 
speed from 0.5¢ to 0.9c. 
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6. How many electron volts of energy must an electron gain to bring 
its mass (a) to 1.05mo and (b) to 2mo? In each case what is the speed 
of the electron? 

Ans: (a) 25,500, 9.2 X 107 m/s; (b) 511,000, 2.6 X 108 m/s 
7. Carry out the operations required to obtain the velocity transfor- 
mations for v given in Eq. (2.7). Starting with Eqs. (2.4a), derive the 
Scangtdemations for v- and v, in terms of v, and v, for a particle with 
arbitrary velocity v. 
8. Ana particle moving east with a speed of 0.5c is passed by an elec- 
tron moving west with a speed of 0.95c. Find the speed of the electron., 
relative to the a particle. ; \ 

Ans: 0.98¢ 
9. An electron moving to the right with a speed of 2.5 X 108, m}s’ 
passes an electron moving to the left with a speed of 2.8 X 108 mys. 
Find the speed of one electron relative to the other. 

Ans: 2.98 X 108 m/s 
10. Show that if two events occur simultaneously at different points 
(x1,¥1,21) and (a2,y2,z2) in the S frame, they occur at different times ia 
S’ with i, — t = yV(ai — 22)/c?. 
11. Observer A is at rest in a frame S’ which moves notizar past 
an inertial frame S at a speed of 0.6c. A boy in the latter frame drops 
a ball which according to observer A’s clock falls for 1.5 s. How long 
would the ball fall for an observer at rest in the S frame? 

Ans: 1.28 
12. The mean life of a muon at rest is about 2 X 10-6 s. If muons in 
cosmic rays have an average speed of 0.998c, find the mean life of these 
muons and the approximate distance they move before decaying. 

Ans: 3.2 X 10-5 s; 9.5 km 
13. The observed half-life of pions at rest is 1.8 X 10—° s. A beam 
of pions has a speed of 0.95c. In the laboratory system how long will 
it take for half the pions to decay? Through what distance will they 
travel in that time? 


Ans: 5.8 ns; 1.7 m 

14. A constant force F acts on a body initially at rest for a time t. 
Show that its speed v(t) is given by ¢/[l + (moc/Ft)?]i. Show further 
that if Fl < moc, v(t) = Ft/mo, while if Ft >> moc, v(t) = 

15. A particle has a rest energy moc? and a total energy E. Show that 
the velocity of this particle is c V1 — (mgc?/E)*. 

16. Fizeau showed that the speed of light in a liquid streaming toward 
a light source with velocity u was given by v = c/n — u(l — 1/n?), 
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where n is the refractive index of the liquid. Show that this result 
agrees with the relativistic law, Eq. (2.7), provided that terms in u/c 
higher than the first power are neglected. 
17. Show that if an observer in S determines the length of a meter 
stick in S’ by determining the time. required for the meter stick to pass 
a fixed point zı and multiplying this time by V, he obtains the same 
contraction found in Eq. (2.5). 
18. A particle has momentum p, kinetic energy K, velocity v, and rest 
mass mp. Show that 


po _ K/me? + 2 
K K/me? +1 


Prove that this equation gives the classical K = mv? for K & moc? 
and K = mv? for K > me’. 
19. Compute the daily loss of mass of the sun associated with the 
isotropic emission of electromagnetic radiation, given that the average 
solar constant at the earth is 1400 W/m? and that the average radius 
of the earth’s orbit is 1.5 X 10!! m. 

Ans: 3.8 X 10" kg/day 


20. An electron moving perpendicular to a uniform magnetic field B 
is bent into a circular path. Show that Br = »/K? + 1.02K/300 if 
B is in webers per square meter, r in meters, and the kinetic energy K 
is in MeV. 
21. (a) A stick of length Ly is at rest in S, making an angle 0 with the 
z axis. Show that an observer in the S’ frame finds the length to be 
L’ = L,(cos? 6/7? + sin? 6)! and the angle & with his z axis to be given 
by tan & = y tan 0. 

(b) A particle moves at speed v an angle 6 with the z axis in the 


S frame. Find its speed and direction relative to the z’ axis of the S’ 
frame. 


. y = Í — 2V0 cos 0 + V? — [(Vv sin 6) /c}*}3 
e E (Vo cos 6)/c? 
_ VI = ve ov sin 6 
taal ei v cos 6 — V 


22. Show that for a 


particle having acceleration nts a, and ay 
in the S frame, PSE 


the corresponding accelerations in the S’ frame are 
at = L V*/e)a, 
* (1 = Vose 


Introduction to Relativity 7I 


and 
po E Vo,/c? 
% = T= Ve/ey (o EEIE a.) 


23. In 1906 Einstein proposed a gedanken (thought) experiment to 
show that radiant energy has associated with it a mass equivalent equal to 
the energy E/c?. He supposed that a burst of photons with total energy 
E and momentum E/c left one end of a cylinder of length L and mass 
M. While the photons traverse the length of the cylinder, the latter 
recoils in the opposite direction until the burst is absorbed at the BA 
end. Show that the photons must have transferred a mass E/c? if the 
center of mass of the isolated system refnains fixed. 

24. Show that if the force between two particles is a function of only 
their separation, Newton’s second law F = ma is invariant under a 
galilean transformation. 

25. Show that the Lorentz-FitzGerald contraction leads to a nonzero 
fringe shift if the interferometer paths differ in length and if the velocity 

< of the interferometer relative to the ether varies. 
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chapter three 


Relativity and Four-veectors 


In relalivily, space and lime are no longer separale concepts bul are 
interwoven in a way thal makes the introduction of a four-dimensional 
space-lime conlinuum very useful. In such a four-dimensional space many 
of the laws of classical physics can be expressed in a particularly elegant 
form which greatly simplifies the shifl from one inertial system lo another. 


3.1 The Interval between Events 


As we saw in Chap. 2, observers O and O’ in inertial frames S and S’ 
moving relative to one another ordinarily assign different time and space 
coordinates to an event which both witness. By an event we mean 
some physical occurrence detected by an observer; e.g., a cosmic-ray 
electron interacts with a nucleus in the atmosphere to produce a photon. 
To O the event is specified in terms of three spatial coordinates and the 
time, hence by the four coordinates (21,y1,21,l1). 
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Next consider a second event, perhaps the decay of a pion, to 
which O assigns the coordinates (x2,¥2,22,l2). We shall define the interval 
between the two events to be the quantity si2, where 


si? = X(t, — h)? — (2 — 1)? — (y2 — yi)? — (z2 — 21)? (3.1) 


If observer O’ detected the same events, he would specify them with 
coordinates (z1,¥},Z1f1) and (},¥3,25,l5). Even though O and O' might 
find significantly different values of the time elapsed between the two 
events and of their spatial separation, the interval is the same for both; 
it ig an invariant common to all observers in inertial frames. 

When sıx? is positive, the interval sız is a real number and is said 
to be limelike. For a timelike interval it is always possible to find an 
inertial frame in which the two events occur at the same place (values 
of z, y, z). However, it is impossible to find an inertial frame in which 
the two events occur simultaneously (lz = 4) since in such, a frame sy." 
would be negative, contrary to the first condition of this paragraph. 

If sı: is negative, the interval is imaginary and is referred to as 
spacelike. For any spacelike interval, it is always possible to find some 
inertial frame in which the two events are simultaneous, but it is impos- 
sible to find a frame in which the events occur at the same place. Thus 
the spacelike or timelike character of an interval is independent of the 
inertial frame of reference. In the discussions of Sec. 2.6 and in Fig. 2.5 
all points in the regions marked “future” and “past” are separated from 
the origin by timelike intervals, while those in the regions marked 
“present” are separated from the origin by spacelike intervals. 


3.2 Four-vectors 


Let us introduce w = icl (i = ~/—1) and rewrite Eq. (3.1) in the form 


—812? = (z2 — 21)? + (y2 — y1)? + (22 — 21)? + (w: = ws)? 
(3.1a) 


Here we may interpret z, y, z, and w as cartesian coordinates in a four- 
dimensional euclidean “space” in which +/—s,.? may be regarded as 
the “distance” between points (x1,y1,21,w1) and (x2,72,22,W2)- The corre- 
sponding ‘‘displacement” can be treated as a vector with components 
te —2y Yo — Yu 2 — 21, and w2—w in a four-dimensional space- 
Such a quantity is often called a four-veclor or a world vector. 

If z, y, z, w are the coordinates of an event in frame S, We 


may 


Min aii‘ 
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consider these quantities as the components of a four-dimensional radius 
vector for this event. Thus, if 1., 1,, 1., and lẹ represent unit vectors 
in the z, y, z, and w directions, 


r = laz + lyy + 1z + lw 
and 
Paerr=e2+y+72+u? 


since lz» 1, = 1, 1.- l, = 0, etc., just as for ordinary three-dimensional 
vectors. The magnitude of the four-vector r is given by 


T= af? ty? zt wt 


If we wish to find the components of the vector r in frame S’ moving 
in the « direction with speed V and with times measured from the 
instant the origins of the two frames coincide (Secs. 2.1 and 2.7), we 


apply the Lorentz transformation, Eqs. (2.4), which can be written in 
matrix form 


Ki Yr 0 0 ibryr r 
yae 0 Tio 0 y 
gl lo ti RO EE G2 
w’ —ißbryr 0 0 yr w 


where yr = 1/+/1 — V?/c and Br = V/c. It is left as an exercise 
(Prob. 1) to show that Eq. (3.2) is equivalent to Eq. (2.4). 

Minkowski: discovered another way of looking at the Lorentz 
transformation; Eqs. (2.4) are mathematically the same as the equations 
for the transformation of a four-vector under a rotation of the x and w 
axes in the four-dimensional z, y, z, w space through an imaginary 
angle ¢ = tan”! ir. For any point P with coordinates (z,w), the 
corresponding coordinates (2’,w’) are, from Fig. 3.1 and analytical 
geometry, 


z’ = z cos ọ + w sin w’ = w cos ġ — zT sin ġọ 
while the Lorentz transformation, Eqs. (2.4) with w = ict, gives 


az! = yrr + iBryrw = yrw — iBryrt 
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Fig. 3.1 Rotation of the z and w 
coordinate axes through angle ¢. 


Comparing these two pairs of equations shows that 


— and 


cos ġ = ER i = i 
i= n EJA sin @ = [Bryr 
iV 


“evi -ra 


In addition to the radius four-vector many other useful four-vectors 
appear in relativistic physics. Here a four-vector is defined as a quantity 
whose components transform in the same way as (r,y,z,w) under a 

‘Lorentz transformation. Thus, if A is a four-vector, it must transform 
to A’ in going from S to S’ in the same way as r, viz., 


AC Yr 0 0 ißryr|| Az 
ai} 6@ 2a 0 fla, > 
A & wt A pea 
A —ißryr 0 0 yr Aw 


The magnitude of a four-vector A is a scalar invariant, i.e., the same 
for all inertial frames. 

If A and B represent four-vectors, A = 1Az + lyAy + l-A: + lvAw 
and B = 1.B,+1,B, + 1:B: + 1.B., the scalar product A- B= 
A,B, + A,B, + A-B: + A,B,. However, the cross product A x B 
cannot be represented by a vector; rather it is defined as the antisym- 
metric world tensor , the components of which are the determinants 


of the matrix 
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or 


AXB=o= o @.3) 
Por Puy Pu: 0 


where z, = —®,2 = A-B — A,B., etc. 


3.3 Proper Time and the Four-momentum 


One of the postulates of special relativity is that properly formulated 
laws of nature are the same for any two inertial frames; thus they may 
not include anything which refers to the translational velocity of the 
reference frame. If we let f stand for the classical resultant force on a 
body, Newton's second law of motion in the form f = dp/dt does not 
satisfy this postulate. Even when the relativistic change of mass with 
velocity is introduced, the law does not maintain its form under a Lorentz 
transformation. Applying Newton's second law with p given by Eq. 
(2.10) yields 


dp _ Mo dy | vdv/dt 
oS ee Gi Taca v) (3.4) 


When the direction of the force differs from that of the velocity, the 
acceleration is not in the direction of the force unless the speed is con- 
stant (as it is in the very important case of a particle moving perpendic- 
ular to a uniform magnetic induction B). Further, when the force is 
in the direction of the velocity, the ratio of force to acceleration is given 
by mo/(1 — v?/c?)}, which is sometimes called the longitudinal mass. 
When the force is perpendicular to the velocity, the force/acceleration 
ratio is mo/(1 — v?/c?)', sometimes called the transverse mass. In the 
clumsy formalism using three space coordinates plus time, we have the 
fiction that mass, an intrinsic property, depends on reference frame, and 
worse, that force has complicated transformation properties. In four- 
vector formalism it is possible to remove these difficulties and arrive 
-at a logical, self-consistent system. 

In Sec. 3.4 we shall describe one way to put Newton’s second law 
into a valid four-vector form compatible with the Einstein postulates, 
but first we must introduce the proper lime r, which is time measured in 
an inertial reference frame in which the body is instantaneously at rest. 
For this special frame S’ the spatial coordinates 2’, y’, z’ do not change 
in an infinitesimal time interval dé’, so that by the invariance of intervals 
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(Eq. (3.1)] we have 
c? dr? = c? dl’? = c? di? — (dz)? — (dy)? — (dz)? 


or 


dr = {ao = (CE a EER = di i a 2 (3.5) 


where z, y, z, t are measured in the inertial reference frame S of observer 
O. Since c dr is an interval, the proper-time element dr is the same for 
observers in all inertial frames. As in ordinary three-dimensional space, 
a quantity which has the same value in all inertial frames is called a 
scalar; thus we say that dr is a Lorentz scalar. When the elapsed time 
in S is tg — 4, the corresponding proper-time duration is 


A 2 
non fi as (3.5a) 


where v is a function of time t for the accelerated object. From Eq. 
(3.5a) it can be seen that the proper-time interim for an accelerated 
body is always less than the corresponding interim in any inertial frame 
from which the motion is observed. 

We now define the four-velocity u of the body as the derivative 
of its four-vector radius r with respect to proper time 7: 


dz dy dz dw 1 
= (1. di tw drt hg + be) AN 


(3.6) 


where v? is the square of the ordinary velocity. Thus u, = yv.; in 
general the spatial components of the four-velocity are y times the 
components of the ordinary velocity, while the fourth component is iyc. 

The four-momentum P of any particle is the product of the four- 
velocity u and the proper mass mo. Note that the adjective proper 
means relative to an inertial frame in which the body is instantaneously 
al rest. The proper mass mo, the proper length Lo of an object, and the 
proper time 7 are all Lorentz scalars, whence comes their usefulness. 
From the definition 


P = mou = moy(lav: + 1,0, + Lv, + loic) (3.7) 
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observe that the spatial components are the ordinary momentum 
nponents. The fourth component is imoye = i(mc?)/c = iE/c, where 
is the total energy [Eq. (2.13)] of the body. Thus the time component 
the momentum four-vector is the energy multiplied by VW—=1/c. When 
e transforms from one’s rest frame S to a moving frame S’, the four- 
omentum transforms as a four-vector, so that, by Eq. (3.2a), 


j VE z k 
p: = Yr( P= — -z P= P PL= Ps (3.8) 


E’ = yr(E — Vp.) 


The four-momentum has a magnitude (P-P)? = imec. Indeed, 
te dol product of any two four-veclors is a scalar invariant. Further 
‘+P = p? — E*/c? = —mp%c?, which is Eq. (2.15) in a different form. 
Vhile the rest mass is the same in every inertial frame, the momentum 
»and the total energy E are different. 

The four-momentum concept can be applied to find the equations 
or the electromagnetic doppler effect. Consider a photon of energy hy 
ind momentum I(hv/c), where 1 is a unit vector in the propagation 
lirection in frame S. Let 0 be the angle between 1 and the x axis and 
et 1 lie in the zy plane. In frame S’, moving with velocity V along 
she x axis of S, the components of the four-momentum of the photon 
are given by 


PL hy! cose Yr 0 0 irre || hy S82 
i am 
a | 
p: 0 OP OG) 
> Zo —ißryr 0 0 yr z 
(3.9) 
where yr = 1/ V1 — V2/c? and Br = V/c, from which 
v’ cos 6 = yry (ccs 6— ) (3.10a) 
v' sin 6 = vsin 0 (3.105) 
and 
v = rv ( - yose (3.11a) 
tang = 8 (3.118) 


~ yr(cos 0 — V/e) 
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In general, 6’ ~ 6 and, unlike the case of acoustic waves, there is 
a doppler shift for @ = 90°. This transverse doppler shift arises from 
the different rates at which clocks run in S’ and S as seen by an observer 
in either frame. 


3.4 Relativistic Mechanics 


In classical mechanics the conservation of energy and the conservation of 
momentum are key laws. In relativistic mechanics the energy of a 
particle may be looked upon as the time component of the momentum 
four-vector multiplied by c/x/—1. Conservation of momentum and 
conservation of energy can be regarded as the spatial and temporal 
aspects of a single conservation of four-momentum. In the absence 
of an external resultant force on a particle, Pz, Py Pz and Æ are all con- 
stants of the motion. When N particles interact or decay in an isolated 
system, the laws of conservation of both momentum and energy are 
implicit in the relation 


Ni Ny 
CX Pine zé ( D Perinat (3.12) 


where P: is the four-momentum for the ith particle. 

The rest mass Mo of a system of particles, such as an atom, can 
be defined as the total energy of the system in the center-of-mass 
reference frame divided by c?. Then, for a system of noninteracting 
particles, i.e., freely moving with no potential energy of interaction, the 
rest mass Mo is greater than the sum of the rest masses of the particles 
by 1/c? tires the kinetic energy of the particles in the center-of-mass 
frame. However, when there are attractive forces between the particles, 
the negative potential energy arising from the interaction may bind the 


particles into a composite body with My < pi (mo). For an atom the 


difference AM = Mo — X (mo); is called the mass defect, and AMc? is 


the binding energy of the atom. When AM is negative for every possible 
way in which an atom can be broken up, the atom is stable. When AM 
is positive for some form of breakup, it is energetically possible for the 
atom to decay spontaneously. This criterion is useful in atomic and 
nuclear physics. j 

While the statement that “mass is a form of energy” is frequently 
made and successfully applied in many problems, it is ambiguous in 
some cases. For example, when a number of uranium 236 atoms fission 
in a nuclear explosion, the lolal rest mass of the system (fission products,/ 
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kinetic energies of all particles including photons and neutrinos, etc.) 
is unchanged, but the sum of the rest masses of the individual con- 
stituents is less by about 0.1 percent than the original rest mass. An 
unambiguous statement of the relationship between rest mass and energy 
is available from the four-momentum vector for the system, which is the 
resultant of the four-momenta of the constituents. Here the energy E 
is proportional to the time component of the momentum four-vector 
(this component is t£/c), while the rest mass of the system is a measure. 
of the magnitude of the four-vector (the magnitude is iMac). We may 
summarize as follows. 

In any reaction between two or more bodies, the resultant momen- 
tum-energy four-vector of the system is conserved, and so the ordinary laws 
of momentum and energy conservation coalesce into a single relativistic 
law. Further, for any particle the invariant magnitude of P is imoc, 
where mo is the rest mass of the particle. For particles with zero rest 
mass, such as photons or neutrinos, p = E/c. The relativistic con- 
servation of momentum-energy is valid for both elastic and inelastic 
collisions. 

As we have seen in Sec. 3.3, certain ambiguities arise when one 
considers Newton’s second law in moving inertial frames. Minkowski 
devised a formulation of this law which is consistent with the postulates 
of special relativity by equating what we call the Minkowski force four- 
vector F to the derivative of the four-momentum with respect to proper 
time 


4 = 1,9: dp, dp. dpe 
Ee 7, Mee at ba el oe tere 
dp: d, dp. i dE 
= Ler Gt + ly Gt the Gt + ben ae 
_ dp idE 
a Thar + loy cat (3.13) 


The spatial part of the Minkowski force is equal to y dp/di, where P 
is the ordinary momentum. By the classical definition the ordinary 
force is f = dp/di, so that the spatial part of the Minkowski force is 
greater than the ordinary force by the factor y and its spatial com- 
ponents arè 


E ES: mE Sa a 
PARSI E aie A A 


F= zo (3.14) 
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The time component of the Minkowski force is 


ide 1 
ed VL- oye 


The time rate of change of energy of the particle is equal to the rate at 
which the force f does work on the particle, and this is the power f- v 
delivered by the force. Thus the time component of F becomes 


ide A m (3.15) 


Le Aa ye 1 = v*/c? 


where f and v are the classical forces and velocity, respectively, and the 
Minkowski force may be written 


F=y7f+ Luiyf + v/c 


3.5 Relativity and Electromagnetism 


Contrary to the situation in mechanics, a review of the laws of the elec- 
tromagnetic field shows them to be in harmony with relativity as they 
stand. This might perhaps have been expected in view of the fact 
that the theory of relativity actually developed out of experiments in 
that part of the field of electromagnetism which is called optics. Never- 
theless there are many aspects of electromagnetic theory to which it is 
fruitful to apply relativity theory. The advantages of using four-space 
notation turn out to be great. 

Let us postulate that the charge q of a particle is a scalar invariant. 
Consider a uniform charge distribution at rest in frame S. The charge 
density in a small cube Az Ay Az is po = 2qi/(Ax Ay Az), where the sum 
is carried out over all charges in the cube. An observer O’ in a moving 
frame S’ agrees on the charge enclosed in the cube, but by Eq. (2.5), 
Az’ = At V/I — V?/c?, although Ay’ = Ay and Az’ = Az. Thus 0’ finds 
the charge density to be p’ = po/+/1 — V?2/c? = yrpo; because of the 
Lorentz-FitzGerald contraction of the volume element, p’ exceeds p. 

Next consider a uniform charge distribution of density p moving 
with velocity v relative to frame S. The current density is then 
J = 2qiv/(Ax Ay Az) = pv = vpo, where y = 1/VI = v/c. Clearly, 
in S’ the current density J’ is different, both because v’ differs from v 
and because of space contraction. 

Although neither p nor J is invariant under Lorentz transformation, 
we may introduce a current-density four-vector in the following way- 


| 
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Let u be the four-velocity of the charges in the S frame and po be the 
proper charge density, viz., that in a frame in which the charges are at 
rest. Then the four-vector J = pou is, by Eq. (3.6), 


J = leyp: + lyypovy + leveovs + loiycpo 
= lJ: + ly + lds + lodo (3.16) 


where y = 1/+/1 — v?/c*. The spatial components of J correspond to 
the components of the current density, while the fourth component J, 
is tcypo = icp. 

The equation of continuity in electromagnetism, which follows from 
the conservation of charge, is 


vJ+% = (3.17) 


where the vector operator del is V = 1.0/dz + 1,0/dy + 1,0/dz in car- 
tesian coordinates. In relativity it is convenient to introduce the four- 
vector operator [] defined by 


ð ð sf 
O-LgZthe thet (3.18) 


In this language the equation of continuity becomes simply 


O-J=0 (3.172) 


3.6 Maxwell’s Equations and the Four-potential! 


The ordinary force f acting on a charge q moving with velocity v is 
given by the Lorentz relation — 


f = q[E + (v x B)] (3.19) 


where-the electric intensity E is the force per unit charge when the 
charge is at rest relative to the reference frame. The difference 
f — qE = q(v X B) is the force due to the magnetic induction B. 
The electric intensity E and the magnetic intensity B are thus defined 
for a particular reference frame; the distinction between electric and 
magnetic fields is in part a relative one, depending on the frame of 


1 Familiarity with Maxwell’s equations is assumed in Secs, 3.6 and 3.7, 
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reference. For example, as we saw in Sec. 2.2, charges at rest relative 
to the S frame produce only an electrostatic field, but to an observer 
in the S’ frame, these same charges constitute current elements with 
an associated magnetic field. 

The electric displacement D is defined by the relation 
D = «E + P, where P is the polarization (the electric dipole moment 
per unit volume) and eo is the permittivity of free space. Similarly, 
the magnetizing field H is defined by H = B/uo + M, where jo is 
the permeability of free space and M is the magnetization (maguetic 
moment per unit volume). If J represents current density and p 
charge density, these quantities are related through Maxwell’s 
equations: ý 


ôB 
yxE=— al (3.20a) 
vV D=p (3.20b) 
vxH=j+2 (3.20. - 


v-B=0 (3.20d) 


In view of Eq. (3.20d), B can be expressed as the curl of a vector 
potential A; 


B=VxA (3.21) 

When Eq. (3.21) is substituted into Eq. (3.20a) to find E, one obtains 
oA 

Eis aye (3.22) 


where the constant of integration is written as minus the gradient of 
a scalar electrostatic potential ¢, determined by the distribution of 
charges in space. To reduce the complications we shall confine this 
discussion to free space, in which case we can find the electrostatic 
potential at any point P by making this point the origin of an S- 
coordinate system and evaluating the integral 
a ‘L plr, t— r/c) pies 
olt) = d E AET dù (3.23; 


space 


where dU isa volume element. To obtain ¢ at time? we must evaluate 
pat time t — r/c. a 
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Similarly, the sources of the vector potential A are currents, 
and A is given by 5 


AW = f Ho S(t, t= 7/0 Jo ~ 624) 


dT r 


Since in free space c? = 1/yo¢, Eq. (3.23) can be put in the form 


ip Ho tcp(r, t — r/c) 
FS [ a T dù 


space 


Tr 


Bo Jele, t — r/c) 
{ de a Vow 
space 


which represents the time component Aw of a four-potential 


| A = 1-A: F l, A, + LA: F lode =A Fl $ (8:25) 


In free space P and M are zero, and so D = «E and B = uH. 
Then, taking the divergence of Eq. (3.22) yields, with. the aid of Eq. 
(3.20b), 


-V-E= Vo vV- Se (3.26) 


while Egs. (3.20) to (3.22) can be combined to give 


VIA = V(V~A) — poo Se = woe BE = u (8.27) 
Now A is not completely defined by Eq. (3.21), since an infinity of 
vectors may have the same curl. We are further free to require that 
its divergence have some specified value; in this case we follow Lorentz 
in imposing the condition 


V:A + noes? = 0 (3.28) 


Since both A and ¢ are continuous functions of z, y, z, and t, we may 
exchange the order of differentiation. This, plus imposition of the 
Lorentz condition and use of c? = 1/yo¢o, permits us to write Eqs. 
(3.27) and (3.26) in the form 


1 8A 


VA — zT — pJ (3.27a) 
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1ao¢_ Bees 
Vo =a on ~ a Hoc*p (3.26a) 
_@, a, a, a 1a 
With Q -on tat atta VY Bae 


these equations become 
DA= -u OY = — 2 = uep 


or 
D’A = —poJ (3.29) 


in four-vector notation. In terms of the four-potential A the Lorentz 
condition (3.28) is simply Q - A = 0. 


3.7 The Field Tensor 


If one knows the components of the four-potential in one frame, one 
can make a Lorentz transformation to a moving frame and, from the 
potentials, find the electric and magnetic intensities. However, often 
one knows the fields in one frame and would like to find the fields in 


another frame without going through the potentials. This can be 
done through the field tensor fix, given by 


a dA, _ 3A: OA: 9A, OAw ðA: 
ðr əy ðr ðz ðr ðw 
dA, Ay a dA, ðA, dA, aA, 
aya _| ay dae ay ôz ôy  Əw 
fa=DXA=| 54. oa, aA, _ aA, P aAy ðA; 
az əz əz əy “0z ðw 
dA, _9Ay Ay ðAu JA, Av A 
ðw dz ðw ðy ðw dz 
ko 
ESBE a rah tae 
BE MON mise a tee 
= E (3.30) 
B, -B 0 -7 
A iE, iE, iE, 0 
c c Taa 


The field tensor is an antisymmetric second-rank tensor. To 
transform such a tensor from frame S to frame S’ is somewhat more 
involved than to transform a vector. A four-yector, for example, 
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| transforms according to 
4 
Y; = >; CaVi 
Pest 


where Ci is the direction cosine between the i axis in the S’ frame and 
the k axis in the S frame and V, is respectively the x, y, z, or w com- 
ponent of V for k = 1, 2, 3, or 4. ‘The terms of a second-rank tensor 
have two indices and transform according to the equation 


T= SY Cilia @.31) 


j= 
In matrix notation, 


Ty Tis Tis Ti Cu Ci Cy Cy 

Tr Ti Ty Ty, Cx Ca Cr Gu 

Ty Ty Ta Ty Cu Cy Cr Cu 

Ti Ti: Ta T; Cu Ca Ca C4 
Tu Tu Ty, Tu|f Cn Cn Cy Ca 
Tn Tx Ts Tu Cin Cn Cx On (3.82) 
Tu Ts Tu Ta Cis Cu Cn Cys. i; 
Ta Ta Ta Ta Cu Cu Cu Cu 


as can be verified by multiplying out the matrices and comparing 

the result with the expansion of Eq. (3.31). Note that the third 

matrix on the right-hand side of the equation is the transpose of the 

first, i.e., the matrix formed by interchanging rows and columns so 

the element in the ith row and kth column of the original matrix 

appears in the kth row and ith column of the transpose matrix. ; 
The field tensor in the S’ frame is therefore 


ee eee Pe ig tea 2 


is : 
T ts ba (ee rae sn hea 
k l iE. iE, iE, 
Bryr -0- 0 YT A Sa ak 0 
yr 0 0 —ißryr 
0 1 0 0 
pie eet d (8.33) 
igryr 0 0 Yr 
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from which we obtain 
BL=B. Bi = v (B+) 


VE 
B, = ya (B, - 4) 639 
B= E, E= rth — VB) EL = vE, + VB,) 


These equations can be used to ascertain the effect of uniform 
motion upon the field of a point charge. Let charge q be moving with 
i speed v: We take the z axis through the charge and in the direction 
“of its motion. Consider any point P not on the z axis and let this 
_ point and v establish the zy plane (Fig. 3.2). We choose an S’ frame 
“in which g is at rest; in this frame the field is purely electrostatic, 
3 and its components are 
+ 


, z’ ; ; 
El = Gan El= ae EL=0 (3.35) 


Since the S frame is moving with a velocity V = —v relative 
to S’, the transformation equations to the S frame corresponding to 


Eq. (3.34) are 


= B’ na TAES vk, D 1 vE; 
Hoek R 7 (8% *) Balt (a+ i) (8.36) 


E. =E, E,= vE, +B) E, = y(E, — 1B) 


| Fig.-3.2 Charge q at rest in frame 
| 8’ moving with speed v along the z 
‘axis of frame S. 


Now B’ = 0-in S’, and #0 BL=0 = 
and 


cot À Psa z + yi A ye- Mty 


o = yirt cost 0 + r? sin? 6 = y'rfcost @ + (ain? 0/7] 


2! = (@ — of) andy =y. Further z — of = r cos 6, y: 
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B, = B By Eq. GA 
Psi 0, 4 


= vir = (aia 6) /e%). 


Upon making these substitutions in Eqs. (3.36), we obtain = 


7 pe 4 = 0/09) con 


Frer [l = eim o/o 
g(t = v/c?) sin 6 


= reor ll = v (ein? 6) Ee $ 


| aye (3.37) 


B= ql = 08/e%)'sin 6 4 
beers ecru yee a 


The fact that E,/E, = tan 0 shows that the electric field points 
radially outward from the instantaneous location of q. In spherical 
polar coordinates E is in the r direction and B in the ¢ direction in 
any inertial frame. with its origin instantaneously at q. However, 
the field of a moving charge is concentrated into the equatorial plane 


(Fig: 3.3). a y EAT 


B=0, RTAS 


Fig. 3.3 The concentratión of the electric flux ofa moving 


rial plane; the decimals give the fraction of the to 
axis v. The inverse-square law applies at all speeds, 
“Principles of Modern Physics.” Copyright 1959. 


Used by permission.) - 


= 0.99 


Sed 
rom R. B. Lei, SA 
; McGraw-Hill Book ‘Caries A 
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> Many other conclusions can be drawn by transforming fields 
and potentials from one frame to another; e.g., one can easily show 

‘that B-H — D-E, A-J — pġ, and E-B are invariants having the: 
same values in both S and S’ frames. For plane waves in free space 

| poH? = «E7, so that B- H — D-E = 0 in any inertial frame; thus 
plane waves in one frame transform to plane waves in another frame, 
although both direction of propagation and frequency are altered, as 
we saw in Sec. 3.3. 


3.8 General Theory of Relativity 


In considering the bearing of the special theory of relativity upon 
physical laws, we have said nothing about gravitation. After pub- 
lishing the special theory, Einstein took up the problem of harmoniz- 
ing the laws of gravitation with the requirements of that theory. 
Since no physical effect can be transmitted from one place to another 
with a velocity exceeding that of light, it may be assumed that gravita- 

` tional effects are propagated with a finite velocity. What, then, is 
the law of this propagation? 

At the same time, another idea was active in Einstein’s mind, 
In the special theory, only unaccelerated frames of reference had been 
compared, Why this limitation? Could not the principle of rela- 
tivity be generalized somehow so that frames of all sorts would stand 
on an equal footing? 

In studying these questions, Einstein was impressed by the fact 
that gravitational acceleration is exactly the same for all bodies, how- 
ever much they differ in density or in other properties. In this 
respect, gravitational acceleration resembles the relative acceleration 
which appears when a frame of reference is itself subjected to accelera- 
tion, a matter of common experience. Everyone knows how, when 
riding on an elevator, he seems momentarily lighter when the elevator 
is accelerated downward. This effect simulates closely an actual 
change in the force of gravity. By no mechanical experiment, indeed, 
can an apparent gravitational field thus produced be distinguished 
from a true field due to gravitational attraction. 
| Eventually Einstein came to the conclusion that in the neighbor- 
hood of any given point, there should be no difference between a 
gravitational field due to attracting matter and the “apparent” field 
due to acceleration of a frame of reference. This proposition he 
adopted as a postulate called the strong principle of equivalence. 1f 
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the principle is accepted, it leads to the prediction of a number of phys- 
ical effects hitherto unobserved. í Bren 

Light, for example, had not commonly been supposed to be — 
subject to gravitational action. But suppose the earth’s gravitational 
field were abolished within a laboratory by allowing the whole labora- 
tory to fall freely. Then, relative to the laboratory, there would be 
no gravitational attraction; a ball thrown horizontally would travel 

fin a straight line relative to the laboratory, not in a parabola. By 
¥ the principle of equivalence, therefore, a ray of light projected hori- 
Ne zontally would also appear to travel in a straight line; for conditions 
‘relative to the laboratory are the same as they would be out in space 
far from all attracting masses, and there is no doubt that in such loca- 
tions rays of light are straight. Relative to the earth, however, the 
path of the ray of light would be slightly curved. 
| The principle of equivalence implies that a uniform gravitational 
field can be transformed away in ifs entirety by a proper choice of the 
frame of reference. Any field can be transformed away in the neigh- 
borhood ‘of a single point, but, in general, the choice of frame that 
does this varies from point to point. For example, relative to a frame 
falling freely in New York there is no gravitational field in New York 
but one of double strength in Australia. 

There remains the problem as to the law according to which 
gravitating matter determines which frames have the inertial property. 
The law*tnust be such that its consequences agree with those derived 
from Newton’s law of gravitation as a first approximation, since this 
law describes the motions of the solar system with high accuracy; 
and it must also be in harmony with the special theory of relativity. 
Einstein, surmised that the law could probably be stated most simply 
in terms of a formulation that would permit the use not only of any 
frame of reference in the ordinary sense but of any sort of generalized 
coordinates. With the aid of the mathematician Grossmann he found 
out how to write physical laws in a form that is valid for any choice 
of space-lime coordinates whatever. The method involves the use of 
general tensor analysis. Suffice it to say that Einstein found that 
among all possible guesses as to the correct law of gravitation one stood 

f out in contrast to all others as the simplest in mathematical form. 
He adopted this law as a tentative hypothesis and then proceeded to 
look for predictions based on it which could be tested by experiment. 

From the new law of gravitation thus obtained, Einstein deduced 
three novel effects that might be accessible to observation. 

. 1. The motion of the planets should be very slightly altered. In 
‘particular, the perihelion of the orbit of Mercury should be caused 
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to precess about the sun at the rate of 43 seconds of arc per century. 
Perturbations by the other planets cause an advance of the peri- 
" helion by 5556 seconds of arc per century, but the observed advance 
is 5600 seconds of arc per century. Einstein’s theory seemed to have 
removed this annoying discrepancy in astronomical theory. How- 
-ever, in 1961 Dicke and Brans introduced a modified relativity theory 
which predicts a 10 percent smaller relativistic precession, and in 
1967 Dicke and Goldenberg found evidence of a solar quadrupole 
moment large enough to bring the calculated and observed precessional 
rates into reasonable agreement. 

2. Rays of light passing close to a heavy body should be bent 
toward it. In the case of the sun, the deflection should be inversely 
proportional to the distance of closest approach of the ray to the sun's 
center and for a ray just grazing the sun’s surface should amount to 
1,75 seconds of arc. Stars seen adjacent to the sun during an eclipse 
appear to be displaced outward by about 2 seconds of arc (with con- 
siderable scatter in the.data). 

3. Physical processes in a region of low gravitational potential, 
when compared with similar processes at a point of high potential, 
should be found to take place more slowly. Conseguently, atomic vibra- 
tions on the sun are slowed down, and spectral lines observed in the spec- 
trum of sunlight are shifted toward the red as compared with lines 
emitted or absorbed by the same elements on the earth by ansamount 
Ad/d = 2.12 X 10-* In 1960 Pound and Rebka showed that 14.4- 
keV y rays from excited Fe57 nuclei arising from the 6 decay of Co! 
underwent 0.97 + 0.04 times the predicted frequency shift in moving 
through a vertical distance of 22 m at the surface of the carth. 

Einstein’s theory of space, time, and gravitation, thus arrived 

at, is known as the general theory of relalivily. According to this 
theory, the spatial behavior of matter is not quite euclidean. If a 
triangle of astronomical size near a heavy body like the sun were 
surveyed by means of rigid rods, with or without the help of light 
signals, the angles would not quite add up to 180°; and so on. Ein- 
stein’s great intellectual achievement has been followed by various 
proposed modifications, some of which lead to predicted phenomena 
so close to those of Einstein that present observational data are not 
adequate to decide between theories, 


Prot” ms 


1. Prove that the matrix equation (3.2) is equivalent to the Lorentz 
space-time transformation, Eq. (2.4). 
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2. Prove that the magnitude of a four-vector A is a scalar invariant 
for two inertial frames by showing that 

AZ + Aj +A? + Ast = AP + A+ AP + AP 
3. Show that the magnitude of the momentum four-vector is imec. 
4. Show that the four-velocity has the dimensions of speed. Prove 
that it transforms as a four-vector by showing its magnitude is a con- 
stant independent of the inertial frame in which it is measured. 
5. Let frame S’ be moving with constant velocity V along the y axis 
of frame S. If corresponding axes of the two frames are parallel, write 
the Lorentz transformation for the radius four-vector. 
6. Make a Lorentz transformation of the four-velocity from a frame S 
to a frame S’ moving with speed V along the z axis of S. Show that the 
results are compatible with the Poincaré velocity transformation of 
Eq. (2.7). 
7. Show that for any system of N freely moving particles, such as the 
molecules of gas in a box, the total rest mass is given by 


N K. 
M= X [mE] 
is 
where (mo); and K; are the rest mass and kinetic energy of the ith particle 
respectively. 
8. A proton of rest mass m is accelerated to a speed v ~ c in a syn- 
chrotron. If it undergoes a perfectly inelastic collision with a nucleus 
at rest of mass M, find the rest mass and the speed of the resulting 
compound nucleus. 

Ans: Vm? + M? + 2ymM; ymv/(ym + M) 
9. (a) In frame S two identical balls A and B of rest mass mo have equal 
and opposite velocities of magnitude v. They collide and coalesce to 
form a single particle at restin S. Apply the conservation of momentum- 
energy to the collision and find the rest mass of the system after the 
edllision. 

(b) Repeat the problem in frame S’, in which ball A is at rest, 
and show that the rest mass of the system after the collision is the same 
as in part (a). 

10. Transform the four-vector F in S to F’ in S’ by use of the Lorentz 
transformation. Use the result to show that the transformation equa- 
tions for ordinary force are ' 


hoj- i Veh 


c— Vo c?— Vo, 


rV- rE j= V1 — v?/eè 
tem eae eae 
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11. Show that 0 - A = 0 is the Lorentz condition, Eq. (3.28), in four- 
vector notation. 

12. A frame Sı moves with velocity Vi. = Bic with respect to a frame 
S. A second frame S, moves with velocity V2 = Arc relative to Sı. 
Find the velocity of S, relative to S and show that 


ne Bi + b2 
1 + B:B2 


13. Show that if a photon of energy hv is absorbed by a stationary 
nucleus (or other particle) of rest’ mass Mo, the excited particle has a 
mass Mo + hv/c? and a speed hyc/(hy + Mec’). 

14. An electron of rest energy moc? and total energy E strikes an 
electron at rest. Show that if E >> moc?, the maximum energy available 
in the zero-momentum frame is +/2m,c*E. 


15. A particle of mass M, initially at rest, decays into two particles 
with rest masses mı and mz. Show that the total energy of m, is 
(M? + m? — m2?) /2M. 

16. An isolated excited nucleus of mass M emits a y ray in a transition 
to the ground state. If the excited state lies an energy AE above the 
ground state, show that the energy of the photon is AE(1 — AE/2Mc?), 
(Note that this is a special case of the preceding problem.) 


17. Given an arbitrary velocity in frame S such that vz = 0 COS 8, 
Show that in S’ (in which v, = v cos 6’) 


4-3 = VI= VeVi- ra 
e - 1 — vV (cos 0)/c 


and 


_ V1 = V*/e? v sin 0 
00S Creag Se. 


Show further that if v = c, 


ee V1 — V?/c? sin 6 
tau? S Seale = ye 


which is the relativistic equation for the aberration of light derived by 
Einstein in his first paper. 
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18. A positive pion can be produced through the: reaction -p +p> 
p+n4 x+ by bombarding protons at rest with high-energy protons. 
If the rest energies for p, n, and z+ are respectively 938, 939.5, and 
135 MeV, find the minimum kinetic energy for the incident protons for 
this reaction. Hint: Work first in a frame in which the center of mass 
is at rest, the zero-momentum frame. i i 


Ans: 280 MeV 
19. Antiprotons are produced by bombarding protons at rest with 
high-energy protons through the reaction p + p—>p +p + (p +P). 
In the center-of-mass frame find the minimum kinetic energy of each 
proton required for the reaction. In this frame and for this energy what 
are the values of 8 and y? Transform to the laboratory frame and find 
the minimum kinetic energy of the incident proton for the reaction. 

Ans: mec?; +/3/2, 2; 6moc? 
20. Show that B-H — D-E, A-J — pọ, and E-B are invariants 
under Lorentz transformation. (They are therefore valid world tensors.) 
21. A long straight wire of radius ry in a evacuated tube bears a current 
I along the axis of a hollow cylindrical plate of radius R which is ata 
potential V relative to the wire. If the central wire is hot enough to 


emit electrons of negligible energy, show that electrons can reach the 
cylinder only if I is less than 


on ~/(Ve/c)? + 2m,Ve 
woe In (R/ro) 


22. In a crossed-field velocity selector particles of the desired velocity 
pass through undeflected because the forces due to the electric (qE,) 
and magnetic (qv.B.) fields are equal and opposite. Show that the force 
on a particle of the desired velocity is zero in the frame in which the 
particle is at rest. 

23. Stark showed that the spectral lines of elements are split into 
several components by an electric field. Wien showed that Stark pat- 
terns are observed in the spectrum of hydrogen when a beam of protons 
moves at high speed v perpendicular to a magnetic field B, in an other- 
wise field-free space. Show that the effective electric field in which the 
proton finds itself in its own rest frame has magnitude yvB by finding 
the field tensor for this frame. 

24. A long straight wire is electrically neutral in the S frame in which ~ 
it bears a current density j = —nevp, where n is the number of conduc- 
tion electrons per unit volume and vp is the drift velocity of the electrons. 
Show that an observer moving with speed V parallel to the wire sees it 
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charged with a net charge density 


ne Vv, 
Pret = 0, + pL = TE 


‘ 


25. Show that the Lorentz force per unit volume p(E + v x B) is the 
‘space part of a four-vector. Determine the fourth component and show 


that it is a measure of the power expended by the electric field per unit 
volume, 


pens 


ehapter four 
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The question whether matter is continuous or atomic in nalure was 
fairly well seliled during the nineteenth century. A major part of modern 
Physics is concerned with the structure and properties of atoms and of 
combinations of atoms as gaseous molecules and crystalline solids. Although 
classical physics did not achieve a thorough understanding of these topics, 
ut left a vast heritage of enduring material, some part of which is the subject 
of this chapler. 


4.1 Chemical Evidence for Atoms 


Early in the nineteenth century chemists placed the atomic hypoth<sis 
on a quantitative basis. It was then known that there are many ele- 
ments—substances which. cannot be resolved into simpler components 
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by chemical means—and a much larger number of compounds which 
can be broken down into elements. 


Whenever a particular compound is formed, the mass of one element required to 
combine with a fized mass of another is always the same. 


This great generalization is the law of definite proportions or the law of 
definite composition. In 1803 Dalton proposed that each element con- 
sists of atoms, which are very small, indivisible particles, each atom of a 
given kind being identical. When different kinds of atoms combine, a 
compound is formed. Dalton’s theory implies the law of definite pro- 
portions. Indeed, Dalton predicted that two elements A and B might 


form more than one compound and thus presaged the law of multiple 
proportions: 


When the same two elements form more than one compound, the di, [ferent quantities 


of elements A which combine with the same quantity of element B are in the ratio of small 
integers. 


In 1808 Gay-Lussac showed that when two gases interact to form 

4 gaseous compound, the ratios of the volumes of the reacting gases and 

‘of the resulting gas are represented by: small:whole nufnbers; - 'Fhree 

years later Avogadro made the distinction between atoms and molecules 
and advanced his celebrated hypothesis: 


Under identical conditions of temperature and pressure equal volumes of all gases 
contain equal numbers of molecules. 


While an atom is the smallest unit of an element which can participate 
in a chemical reaction, a molecule is the smallest unit of an element or 
compound which exists stably by itself in the gaseous form. Molecules 
are said to be monatomic if they are single atoms, e.g., helium, neon, 
diatomic if they consist of two atoms, e.g., hydrogen, oxygen, and nitro- 


gen, triatomic if they are composed of three atoms, e.g., water, carbon 
dioxide, and so forth. 


For many years the chemical 
igning the number 16.0000 to 
c weight of the lightest element, 
unity. In 1960 to 1961 the 
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correctness of Avogadro’s hypothesis that 1 kg'at wt samples of all 
elements have the same'number of atoms, now known to be 6.0225 X 
1026. We call it Avogadro’s number and represent it by Na; Na molecules 
of a gaseous compound constitute 1 kmole of the substance.’ 


4.2 The Ideal Gas 


In 1738 Daniel Bernoulli explained gas pressure in terms of the collisions 
of molecules with the walls of the container and thus laid the quanti- 
tative foundation for the kinetic theory of gases, which received further 
treatment in the hands of Herapath (1821), Waterson (1843), Kronig 
(1856), and Clausius (1857). A highly unsophisticated model of a real gas 
is an ideal gas, consisting of a huge number of identical monatomic 
molecules, each occupying a negligible volume. These molecules are in 
constant motion in random directions with a broad range of speeds. 
When a molecule collides with a wall or another molecule, we shall 
assume for the moment that the collisions are elastic. (This restriction 
may be relaxed; see Sec. 4.3). In the molecular chaos postulated, all 
positions in the container and all directions of the velocities are equally 
probable. 

Consider a single molecule of mass m and speed v in an otherwise 
empty cubical box with sides of length L, one corner of which serves as 
origin for cartesian coordinates with axes along edges of the box. Let 
vz be the z component of the particle velocity. When this molecule 
collides with a face parallel to the yz plane, the z component of its momen- 
tum is changed by 2mv.. A time L/v, later the molecule strikes the 
opposite wall, and in time 2L/v, it returns to the first face. Hence the 
number of collisions per unit time with this face is v-/2L, and the change 
in momentum per second at this face for one molecule is 


cme (Gt) = (F) 


Let us now add identical molecules until there are Avogadro’s num- 
ber N4 molecules in the box. It can be shown that the resultant change 
in momentum per second at the face is just what it would be if the 
molecules did not collide with each other; hence 


Na 


Change in momentum/second = > ` (vz)? = Na = (02?) av 


iml 
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_ where (v;*)av is the average value of the square of vz for all N4 molecules. 
By Newton’s second law the change in momentum per second is equal 
to the average force on the face, which in turn is the product of the 
pressure p and the area L? of the face. 


F= pL = Namoa (4.1) 


Now L? is the volume of the box and the volume occupied by 1 kmole. 
We call it Vm and rewrite Eq. (4.1) 


PVn = Naml) (4.1a) 


Similar arguments for faces parallel to the zy and zz planes lead to rela- 
tions pV, = Nam(v,*)ay and pVm = Nam(v,)av, Where p is known experi- 
mentally to be the same at all faces. Therefore, (vz?)ay = (vy?)ay = (022)avs 
and since (v®)ay = (v2*)av + (0,*)av + (v?)avs P 


2N. 
PVm aS [5 me) (4.2) 
But, by the general gas law 
PVm = RT (4.3) 


where T is the absolute temperature and R is the universal gas constant, 
8314 J/kmole-°K. Comparison of Eqs. (4.2) and (4.3) leads to 


amoa = 3 eT = AT (4.4) 


from which we see that the average kinetic energy of translation for an 
ideal gas molecule is directly proportional to absolute temperature. The 
ratio k of the universal gas constant to Avogadro’s number is known 
as the Bollzmann constant and has the value 1.38 X 10- J/molecule-°K. 
The energy kT, which will appear frequently in subsequent sections, 
has the value 4 X 10-2! J or gy eV at 290°K, just below room tempera- 
ture. Less than 0.02 percent of the molecules in a gas have an energy 
10kT. 
ae eaten (4.4), derived above for an ideal gas, is applicable to any 
] gas for conditions under which the general gas law, Eq. (4.3), is 
~~ ed. At room temperature molecules of hydrogen, nitrogen, oxyge®) 


obey n the atmosphere all have the same average kinetic 


and carbon dioxide i 
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energies of translation, so that the root-mean-square speed trms = [eav] 
of each type is inversely proportional to the mass of the molecule. As 
the temperature is reduced, attractive forces. between molecules become 
important and the gas liquefies. (Extension of classical ideas to liquids 
and solids led to the conclusion that at 0°K all atomic and molecular 
motion would cease, but this extrapolation is not valid. Quantum 
mechanics predicts a zero-point motion. at 0°K.) 


4.3 Degrees of Freedom and the Equipartition of Energy 


Any mechanical system has a certain number of degrees of freedom, 
defined as the minimum number of independent coordinates in terms of 
which the positions. of all the masses- composing the system can be 
specified. For example, a block sliding in a groove has one degree of 
freedom; if it is sliding on ice with no possibility of being set into rotation, 
it has two degrees of freedom; if it can also rotate about a vertical axis, 
it has three. A monatomic gas molecule behaving as if it had only 
energy of translation has three degrees of freedom, corresponding to the 
three coordinates x, y, z (or r, 0, ¢), that specify its position in space. 
An ideal diatomic molecule formed by two mass-point atoms has six 
degrees of freedom, which might be chosen as the coordinates of the two 
atoms or, alternatively, as the three coordinates of the center of mass, 
two angular coordinates to locate the axis of the molecule, and a sixth 
to locate the distance of one atom from the center of mass. If the dis- 
tance between the atoms is fixed, only five coordinates are needed, and 
so there are only five degrees of freedom. In general, a triatomic 
molecule has nine degrees of freedom, but this number is reduced to six 


if the separations of the atoms are fixed in a plane and to only five if 
they are fixed along a line. 


Degrees of freedom are additive: the total number of degrees of 
freedom of a system of bodies is the sum of the number of degrees of 
freedom possessed by all the bodies which make up the system. To 
each degree of freedom there corresponds an independent term in the 
expansion for the kinetic energy of the body in question. For example, 
monatomic molecules behave as though they had only translational 
motion. The average kinetic energy Kay = 3kT by Eq. (4.4), and there 
are three degrees of freedom per molecule; there is on the average kT 
of energy associated with each degree of freedom. The classical principle 
of equiparlilion of energy states: 


For a system in thermal equilibrium the average kinelic energy associated with each 
degree of freedom has the same value, kT. 


| 
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This principle,-proposed by Clausius in 1857 and used by Maxwell 
in 1860, was useful in explaining the specific heats of monatomic gases 
and. some solids at room temperature, but it failed for other gases and 
for all solids at low temperature. Not until the development of quantum 
mechanics were these specific-heat anomalies resolved. For an. ideal 
monatomic gas the energy E, per kilomole is the product of Na 
and the average kinetic energy per molecule. Hence, by Eq. (4.4), 
Em = Na(3RT/2N4) = RT. The heat capacity per kilomole at con- 
stant volume is given by (cy)kmote = (@En/dT)y = $R. This result is in 
excellent agreement with the measured values for monatomic gases; for 
example, cy/R = 1.52 for helium and 1.51 for argon at room temperature. 

For a rigid diatomic molecule with five degrees of freedom, equipar- 
tition predicts that Em = RT, so that cy/R should be $; experimental 
values are 2.44 for hydrogen, 2.45 for nitrogen, 2.50 for oxygen. How- 
ever, the ratio of 3.02 for chlorine presented a problem; if the two atoms 
were rigidly connected, the ratio should be 2.5. If the atoms could 
vibrate, it should be 3.5; the addition of this simple harmonic motion 
would give six degrees of freedom, and associated with the average 
kinetic energy of simple harmonic motion there is an equal average 
potential energy which adds another 3kT per molecule. Similar problems 
arose in connection with HOH and SO:, for which cy/R has the values 
3.3 and 3.79, respectively, at room temperature. 

If energy is exchanged between rotational (or vibrational) kinetic 
energy and translational energy in collisions of polyatomic molecules, 
the assumption of Sec. 4.2 that the collisions are elastic is not justified. 
Actually, it is not necessary that collisions be elastic but only that 
(v*),y in Eq. (4.2) remain constant. This condition is satisfied when 
the system is in equilibrium, since equilibrium means that such macro- 
scopic quantities as (v?),, = v?,,, are stationary in time. 

In 1819 Dulong and Petit found that “the product of the atomic 
weight and the specific heat is the same for all elementary (solid) sub- 
stances.” This product is called the alomic heal; it is numerically equal 
to the heat capacity per kilomole, At ordinary temperatures this law 
of Dulong and Petit holds roughly for many substances, the heat capacities 
he for 58 elements ranging from 2.7R to 3.5R with an average 
of 3.1K. 

A few striking exceptions occur, however, notably boron, with heat 
capacity 1.7R; beryllium, 1.9R; diamond (carbon), 0.73R; and silicon, 
2.5R. These are all light elements with high melting points. Extended 


of these elements increase 
er observed that the specific 
C. Later it was found that 
tances decrease rapidly if the temperature 
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Cy, keal/(kmole) (°K) 


1200 


Temperature, °K 


Fig. 4.1 Variation of the heat capacity per kilomole at constant volume 
with temperature. 


is lowered sufficiently, tending toward zero at 0°K. Typical curves 
(Fig. 4.1) for elementary substances are all of the same form; i.e., they 
can be brought (almost) into coincidence by making a suitable choice 
of the temperature scale for each. Such a striking regularity must have 
its source in a simple general principle (treated in Sec. 20.7). Classical 
theory suggested that in a solid the atoms vibrate about certain mean 
positions, behaving like harmonic oscillators with three degrees of free- 
dom. The heat capacity per kilomole should be the sum of $R for 
kinetic energy plus ¿R for the associated potential energy, a total of 3R 
in approximate agreement with observations. However, classical theory 
offers no explanation for the rapid decrease in the heat capacity at low 
temperature. 


4.4 The Maxwellian Distribution 


From the earliest days of kinetic theory it was recognized that the 
speeds of the gas molecules would vary over a wide range at any instant 
and that the speed of any one molecule would vary over a similar range 
in time as a result of collisions with other molecules. In 1859 Maxwell 
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in a brief and brilliant paper deduced the distribution of velocities by 
the following reasoning. 

Consider a gas of NV identical molecules, each of which has velocity 
components vz, vy, v. Let the number of molecules for which v, lies 
between vz and v, + dv. be Nf(v.) dvs, where f(v) is the function to 
be determined. Similarly, the number of molecules for which vy lies 
between v, and v + dv, is Nf(v,) dv, and the number with v, between 
v: and v: + dv, is Nf(v.) dv:, where f always represents the same function. 

Next Maxwell asserted that the value of vz does not affect in any 
way the value of v, or v, since they are mutually perpendicular and 
independent. Therefore, the number of molecules with velocity com- 
ponents lying between v- and vz + dvz, vy and v, + dv,, and v, and v. + dv. 
is Nf(v2)f(v,)f(vz) doz dv, dvs. : 

If we represent each velocity by a scaled vector with its tail at 
the origin, the number terminating in any volume element dvs dv, dv 
in. this velocity space is just the same Nf(v.)f(v,)f (v2) dvs dv, dv. But 
since the z, y, and z axes are perfectly arbitrary, this number must depend 
only on the distance from the origin, and so f(vz) f(v,)f(v.) must be some 
function of the speed alone (or of its square). Therefore, 


IOSUA) = plo + v + 02) = $(v2) 
Solving this functional equation leads to the results that 


S(vz) = Cem (4.5a) 


and 
o(v?) = De~4** (4.56) 
where C, D, and A are positive constants. The negative sign accompanies 
A since a positive sign would lead to the number of particles in a given 
volume of velocity space increasing monotonically with speed. 
In Appendix 4A the constants A, C, and D are evaluated by use 
of the conditions that the total number of particles is N and the total 
tran-lational kinetic energy is $VkT. Substitution of these values in 


Eq. (4.5) leads to the result that the number of molecules having vz 
between v, and v, + dv, is given by 


Nf(v.) dv, = Te f 
floa) dvs N (547) emman do, (4.6) 


while the number of molecules havi 
between t, and v, + drz, v, and v, 


ng a velocity v with components 
+ dv,, and v, and v, + dv, is 


Note) dos do, de, = N (, ar). emma? do, do, do, (4.7) 
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Fig. 4.2 Maxwellian distribution of the z component of molecular 
velocity. 


To find the number of molecules having speed between vand v + dv, 


we replace the volume element dv dv, dv. in velocity space by the volume 
element 47v? dv and obtain 


i 
n(v) dv = N4xv%¢(v%) dv = N (sn) e-PVAT Aro? dy (4.8) 


Figures 4.2 and 4.3 show the maxwellian distribution for v. and v, 
respectively. 


n(v) / N 


Fig. 4.3 Maxwellian distribution of molecular speeds 
in units of vm = V/2kT/m. 
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The average molecular speed 3 is then 


f = m N? Námi? 4 
a= f, e(r) = FTRT dp) = 


while the most probable speed vm is given by 


Um = 4 Jee (4.9a) 


Thus at a given temperature 


(4.9) 


Um:D: Vrms = 1:1.1284:1.2248 


The maxwellian distribution of translational kinetic energies is 
readily found from Eq. (4.8) by making use of the fact that the kinetic 
energy K of a molecule is ¿mv? and dK = mv dv. Then the number of 


molecules n(K) dK with kinetic energies between K and K + dK is 
(Fig. 4.4) 


_ 2N VK exit dK 4 
n(K) dK = SETS (4.10) 


0 kT 


2kT 3k 
Kinetic energy, K 
Fig. 4.4 Maxwellian distribution of molecular kinetic energies. 
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4.5 The Boltzmann Distribution 


Consider the dynamic equilibrium of an isothermal atmosphere (Boltz- 
mann, 1876) in which the molecules of the gas are subject to a constant 
gravitational field, which results in a decrease of pressure and density 
with height. Let n represent the number of molecules per unit volume. 
If p represents pressure at height z, the pressure at a height z + dz is 
less than p by the weight of the air in a volume of unit area and height 
dz, so that 


—dp = nmg dz (4.11) 


By Eq. (4.3), pVm = RT = NakT, so that 


p= Na kT = nkT (4.12a) 
dp = kT dn (4.12b) 


Substitution of this result in Eq. (4.11) gives 
—kT dn = nmg dz 


and 
n= noe ™ozIkT (4.13) 


where no is the number of particles per unit volume at z = 0. The 
choice of the particular height at which z = 0 is arbitrary; it need not 
be at the bottom of the atmosphere. Equation (4.13) is equally valid 
for negative values of z in the isothermal atmosphere. 

If Eq. (4.12a) is used to eliminate n from Eq. (4.13), one finds 
that, in the isothermal atmosphere, p also varies exponentially with 
height: 


p = poe™™ikT (4.14) 


Since mgz is the potential energy P of a molecule relative to z = 0, 
we may write the Boltzmann distribution for the number of molecules 
per unit volume as a function of height [Eq. (4.13)] in the form 


Te L gemotik? = e PIkT (4.13a) 
No 
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4.7 The Sizes of Atoms 


We can readily estimate the approximate radii of atoms, even though 
an atom has no well-defined boundary. Since most liquids and solids 
are difficult to compress, it is not unreasonable as a first approximation 
to think of them as assemblages of hard-sphere atoms in contact with one 
another. As an example, consider liquid nitrogen, which has a density p 
roughly equal to that of water, 1000 kg/m’. The atomic weight A of 
nitrogen is 14, and therefore 14 kg involves N4 = 6.02 X 1026 atoms, so 
that the number of atoms per cubic meter is Nap/A = 4.3 X 1028 and 
the average volume per atom is 23 X 10-*° mê, A little cube of this 
volume would have a side of length 1/23 X 10-% m = 2.8 A. By 
considering various ways in which the nitrogen atoms might be packed, 
we could vary the diameter somewhat, but not by an order of magnitude. 
Similar estimates for other atomic species lead to the conclusion 
that atoms have diameters varying roughly from 1 to 4 A, a remarkably 
small range when one remembers that they differ in relative mass from 
1 to 250 and in number of electrons from 1 to 100. Nor are the heavy 
atoms necessarily among the larger ones. An atom of lithium occupies 
a larger volume than one of platinum. In general, the alkali atoms have 
the largest radii, while carbon in the diamond lattice involves an unusually 
small volume per atom. 
Confirming evidence that atomic diameters are of the order of 
- 10-!° m comes from measurements of collision probabilities and mean 
free paths in gases. A particle of radius r moving along a straight line 
with speed v through an ideal gas of molecules with radius R would 
collide in 1 s with every molecule in a cylinder of radius r + R and length 
v. Ifn represents the number of molecules per unit volume, the number 
of collisions per second would be (r + R)*un, and the mean free path 
(average distance between collisions) \ is given in this crude approxi- 
mation by 


ida 1 
A= EESE (4.21) 


When a beam of particles from an oven or other source passes 
gh a small thickness Az of gas, those particles. which collide with 
gas molecules are scattered out of the beam. The reduction in beam 

rrent —AI is proportional to the product of the incident current I 


an the effective area r(r + R)’n Az subtended by the molecules, 
so that \ 


throu 


Wi tate + R)?n Ar 


(4.22) 
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from which 
I = [geet Rey's = [yezh (4.23) 


Measurements of beam attenuation in gases at reduced pressure thus 
permit determination of mean free paths. By determining }’s for appro- 
priate combinations of beam particles and scattering molecules, one can 
obtain a reasonably self-consistent set of values of molecular radii. 

Kinetic theory predicts that the viscosity of a gas is proportional 
to the mean free path of the molecules. Viscosity values are also con- 
sistent with the conclusion that atomic radii are of the order of 10-?) m. 

Since mean free paths depend on the number of particles per unit 
volume, they are proportional to the pressure so long as the temperature 
is held constant. At room temperature and pressure typical mean free 
paths for oxygen and nitrogen molecules are about 10-7 m. Since the 
rms speeds are roughly 500 m/s, there are of the order of 10° collisions 
per molecule per second. 


Appendix 4A - Integrals Arising in the 
Kinetic Theory of Gases 


From Eq. (4.5) we infer that the number of molecules with v, in the 
range from vs to vz + dv is NCe-4*:' dv.. If we integrate from v, = — œ 
to v = œ, we must obtain the total number of molecules N, and so 


N = ["_ NCe-4 dv, = nex GAI) 


hence C = A/+/r. To evaluate the integral above we note that the 
indefinite integral fe~=" dz cannot be written in terms of elementary 
functions. However, a simple trick permits the evaluation of the definite 
integral. Let 


Then 
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If we convert this integral over the zy plane to one in plane polar coordi 
nates r, 6, we obtain 


o 2r Ar a! o Ez a en = 
r=, h dô e rdr = 2r f re * dr = 2| | = 


so that 


T= jes e7 dr =r and i * e4 dg = ve (4A.2) 


Since f(v) f(v) f(t.) = (v), we have 
C3e-4 ee jea eA — De-A*? 
and so 


D = Œ = Ay} 


To evaluate A we find the total kinetic energy of translation and 
equate it to 4NkT, since we know that the average kinetic energy per 
molecule is $kT. We have just found above that the number of molecules 
with speeds between v and v + dv is NA%r—Hav2e—A* dv. If we multiply 
this by the kinetic energy of translation mv? and integrate over all 
possible speeds, we have 


if * NGM) Amro- dy = 2 Nm Aina} Be (4A.3) 
= 3NkT 


from which A = »/m/2kT. 
In kinetic theory integrals of the form J, = fi vre—4*** dy appear 
frequently. As Eq. (4A.2) above shows, r 


o 1 /q 
£ eare e 
Io ff e dv A 
which is Gauss’ probability integral. Similarly, 


er gees Laer 1 
h= f, ve- dy = ~ gaz le trl? = 533 
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Further, 
al Va al. 
tS 5A, EAE ae AED, T 
1, = oe a= Teli eels 
1 a(A2)? A ATON 6 


and so forth. 
Although fe-=* dz cannot be written in terms of elementary func- 


tions, the indefinite integral if ? e7 dr appears often in probability. 
theory. As we saw above, ja e- dz = ~/x/2. It is customary to 
define the error function or the probability integral as 


erf w = mad: 


val 


This integral has been evaluated numerically for various values of w 
and is widely tabulated. 


Problems 


1. Show that the most probable velocity of a gas molecule is (2k7/m)}. 
If um, 6, and vma Stand respectively for the most probable, average, and 
root-mean-square velocities, show that for the maxwellian distribution 
Urms:Ü: Um = /1.5:>/4/r:1. 
2. Find the rms speed, the average speed, and the most probable 
speed of Nz molecules in the air at 27°C. 

ANS: Vrms = 517 m/s 
3. Derive Eq. (4.10) from Eq. (4.8). 
4, Carry out the mathematical steps required to evaluate 5 in Eq. (4.9). 
5. The statistical weight for the first excited state of sodium (3?P,) is 
the same as that of the ground state (3°S,), and the energy difference 
corresponding to the yellow line (A = 5896 A) is 2.1 eV. If the surface 
temperature of the sun is 6000°K, find the ratio of the number of Na 
atoms in this excited state to the number in the ground state. 


Ans: 0.018 
6. The standard atmosphere of the International Civil Aviation Organi- 
zation is roughly that corresponding to average annual values of pressure, 
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temperature, and density at 40°N in the United States. At mean sea 
level it has po = 1013 millibars (mb) and density 1.225 kg/m’. At 
10 km the ICAO values are p = 264 mb, p = 0.413 kg/m*. Find the 
corresponding 10-km values for the isothermal atmosphere. 


7. Show that the most probable kinetic energy of a gas molecule is 
kT/2. 
8. Given that the average speed of O, molecules in a container is 
450 m/s, the effective radius is 1.7 A, and the number of molecules per 
cubic meter is 3 X 10%. Find the average time between collisions and 
the mean free path. 

Ans: 2 X 107'°s;9 X 10-8 m 


9. The speed of sound in a gas is given by V = »/yp/p, where y is the 
ratio of the specific heat at constant pressure to that at constant volume. 
Find the ratio of V to the average speed of a gas molecule. Obtain 
numerical values of the ratio for monatomic gases (y = 3) and a diatomic 
gas for which y = 4. 


Ans: 0.804; 0.741 


10. If v denotes the speed of a molecule, calculate the average value 
of 1/v for an ideal gas in thermal equilibrium. How does this value 
compare with the value of 5? 


11. Find the average value of v? for molecules of mass m in a gas at 
temperature T. 


12. The viscosity ņ of oxygen at 20°C and 1 atm pressure is 2.03 X 
10-5 N-s/m?. In 1860 Maxwell derived the relation 7 = pi) for a gas, 
where p is the density, 4 the average speed, and à the mean free path. 
Calculate the mean free path \ and the “diameter” of an oxygen 
molecule. 


Ans: 1 X 10-7 m; 3.3 X 10-1 m 
13. Compute the rms values of the velocity for molecules of hydrogen, 
oxygen, and carbon dioxide at 300°K. 


l4. (a) At 100°C and 1 atm pressure 1 kg of water vapor occupies a 


volume of 1.67 m*. Compute the volume available per HOH molecule 
on the average. 


(b) Given that the densities of ice and water are respectively 900 


and 1000 kg/m’, compute the average volume per HOH unit in each 
phase. y 


15. Find the fraction of the molecules which have the y component of 
the velocity between 0 and vm = ~/2kT/m. 


Ans: 0.422 i 


‘ 
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16. What fraction of the molecules of a gas have x components of 
velocity greater than the average speed 0? Greater than the rms 
velocity? Greater than the most probable speed vm? 

17. (a) Show that the fraction of molecules with speeds between 0 
and v’ is given by 

2Apv'e-4*"" 


erf (Av’) — We 


where A = +/m/2kT. 
(b) Use this result to find the fraction of molecules with kinetic 

energy K, > 10kT. 

Ans: 1.8 X 10-4 
18. Use the result of the preceding problem to find the fraction of the 
molecules of a gas which have speed (a) less than the average speed 3; 
(b) less than the rms speed; (c) greater than 29. 

Ans: (a) 0.53; (b) 0.60 


chapter five 


The Origin of the Quantum Theory 


Relativity and quantum theory represent the two great conceptual 
remodelings of classical physics which were required to bring physics into 
the modern era. The quantum theory, first proposed by Planck in £900, 
arose out of the inability of classical physics to explain the experimentally 
observed distribution of energy in the spectrum of a blackbody. By assuming 
thal electromagnetic radiation was emitled by harmonic oscillators, the energy 
of which could change only by discrete energy jumps of amount hv, Planck 
was able to bring theory and experiment into accord. In the following 25 
years striking confirmations of the quantum hypothesis came from several 
areas of physics. 


5.1 Thermal Radiation 


A hot body emits radiant energy, the quantity and quality of which depend 
on the temperature. Thus, the rate at which an incandescent filament 
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emits radiation increases rapidly with increase in temperature, and the 
emitted light becomes whiter. If this light is dispersed by a prism, a 
continuous spectrum is formed. Such radiation is ordinarily emitted in 
an appreciable degree only by objects which are dense enough or thick 
enough to be opaque. 

Radiation falling upon bodies is at least partially absorbed by them. 
At all temperatures bodies are emitting and absorbing thermal radiation; 
if they are neither rising nor falling in temperature, this is because as 
much radiant energy is absorbed each second as is emitted. In order to 
deal with the simplest possible case of thermal equilibrium, consider a cavity 
whose walls and contents are all at a common temperature. An iso- 
thermal enclosure of this sort is of special interest because the field of 
radiation inside it possesses some remarkably simple properties. 

In an isothermal enclosure the stream of radiation in any given direc- 
lion must be the same as in any other direction; it must be the same al every 
point inside the enclosure; and it must be the same in all enclosures at a 
given lemperalure, irrespective of the materials composing them. Further- 
more, all these statements hold for each spectral component of the radiation 
laken separately. 

Proof of these statements proceeds by showing that if any one of 
the statements were not true, a device could be constructed that would 
violate the second law of thermodynamics. For example, if the stream 
of radiation traveling west were greater than that traveling north, we 
could introduce two similar absorbers, one facing east and the other 
south. One of these absorbers would then become hotter than the other 
by absorbing radiant energy from the stronger stream. We could, 
therefore, operate a Carnot engine, using the two absorbers as source 
and sink, and so could convert heat continuously into work without 
causing other changes in the system, in violation of the second law. 
Radiation of a particular wavelength can be tested similarly by using 

selective absorbers. 

The radiation field in an enclosure has important relations with the 
energy emitted by the walls or other physical bodies. An ideal black 
surface has the property that it absorbs completely all radiation falling 
upon it. Radiation leaving a black surface consists entirely of radiation 
emitted by it. Hence the stream of radiation emilted by any black surface 
or body in any direction is the same as the stream of radiation that travels in 
one direction in an isothermal enclosure at the same temperature. The total 
energy density produced just in front of a black surface, by radiation 
emitted by that surface alone is just half as great as the density of energy 
in an enclosure at the same temperature, since the radiation emitted 
by the body is confined to a hemisphere of directions, whereas in the 
enclosure radiation is traveling in all directions. 
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Blackbody radiation is a phenomenon of great interest from the 
theoretical standpoint, because its properties have a universal character, 
being independent of the properties of any particular material sub- 
stance. Several questions press at once for an answer. How does the 
energy density in blackbody radiation vary with the temperature? And 
what is the spectral distribution of the radiation? Furthermore, we 
wish to understand how this particular distribution is brought into exist- 
ence by the atomic processes. 

Concerning the first two questions, it was found possible, during 
the last century, to obtain further information from thermodynamics 
without making any assumplion as lo the alomic process. -The method 
consisted in considering the effect of expanding or contracting an iso- 
thermal enclosure and taking. account of the work done on the walls by 
the radiation in consequence of radiation pressure. Appendix 5A sum- 
marizes many important classical developments in radiation theory. 


5.2 Early Radiation Laws 


In 1879 Stefan found empirically that the power emitted per unit area 
by a blackbody is proportional to the fourth power of the absolute 
temperature, or 


Rs = oT (5.1) 


where Rg = blackbody radiant emittance = power radiated per unit area 
o = Stefan’s constant = 0.56686 X 10-7 W/m? — °K‘ 
T = absolute temperature 
Five years later Boltzmann deduced the Slefan-Boltzmann law, Eq. (5.1), 
from thermodynamical considerations (Sec. 5A.2). 

Not only the quantity but also the quality of radiation emitted 
by a blackbody changes as the temperature increases. The distribution 
of energy in the blackbody spectrum is shown in Fig. 5.1. We define 
the monochromatic emissive power e, thus: the radiant power emitted per 


unit area in the spectral range \ to à + dì is e, dd; clearly, the radiant 
emittance R = i e, dì. From Fig. 5.1 one sees that e, is a function 
of both wavelength and temperature. In 1893 Wien showed (Sec. 5A.5) 


that for any blackbody 
e = TAT) = AF (AT) (5.2) 


where F(AT) = (AT)S(AT). Thus e,/T® is the same for all blackbodies, 
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as is shown in Fig. 5.2, where the data of Fig. 5.1, obtained in 1899 by 
Lummer and Pringsheim, are presented in a different form. A single 
curve serves to represent blackbody radiation at all temperatures. It 
follows that if Am is the wavelength at which e, is maximum, 


AmT = const = 2.898 X 10- m-°K (5.3) 


for all temperatures—a special case of Wien’s displacement law. 

Thus, by reasoning based on thermodynamics, the problem of 
blackbody radiation is reduced to the determination of a single unknown 
function, f(AT) or F(AT). All attempts lo oblain the correct form for this 
function from classical theory failed. 

Two of the formulas proposed on a classical basis nevertheless 


Energy, arbitrary units 


Wavelength, p 


Fig. 5.1 Distribution of energy in 
constant wavelength interval Ad for the 
blackbody spectrum at four Kelvin 


temperatures. 
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=T = 1646°K 
x =T = 1449°K 
o ÆT = 1259°K 
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AT 


Fig. 5.2 Experimental verification of the Wien displacement law for 
blackbody radiation. 


deserve mention. Wien derived a formula based on special assumptions 
regarding emission and absorption of radiation; he found that the energy 
density U, dd for wavelengths between à and à + da is 


U, dd = 4ee, dà = Cyd-5e-GsPT dy (5.4) 


where c is the speed of light in free space. By proper choice of the 
undetermined constants C; and Cz Wien’s radiation law can be made to 
fit the data of Fig. 5.2 except at higher values of AT, where it predicts 
too low a value. In 1900 Rayleigh applied the principle of equipartition 
of energy to blackbody radiation. This led, with a contribution from 
Jeans, to a formula for e, which fitted the high-\T part of the e curve 
but rose to infinity with decreasing XT. The reasoning employed by 
Rayleigh and Jeans, discussed below, has applications in modern physics, 


5.3 Degrees of Freedom in an Enclosure 


A typical vibrating system, such as a violin string or organ pipe, is 
capable of many modes of vibration. We are interested in the number 
of modes within a given frequency range for a cavity, but it is instructive 
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to apply the method first to a one-dimensional case. Let a string of 
great length L be stretched between two fixed points A and B. Standing 
waves can be set up only for those frequencies which give a whole number 
of loops between A and B. Since each loop corresponds to one-half a 
wavelength, standing waves occur only for those wavelengths \ for which 


i Be 
hrn- (6.5) 


where nz is an integer. 

We wish to know how many of these modes have wavelengths in 
the range between à and à + Aà. If we decrease n. by some integer 
An,, the value of à given by Eq. (5.5) increases to \ + Ad, where 


2L x 
ee = na 
(A+ Ar) 7 ka 
When L is very great and A) is small, we may write An, = —2L A)/)°, 


where An, now represents the number of modes of vibration with wave- 
lengths in the range Ad. 


Now, associated with each mode there are two degrees of freedom, 
since the vibrations are transverse and any point on the string is free 
to move in a plane at right angles to the string. The total number of 
degrees of freedom per unit length of string in the wavelength range 
between à and à + Ad is therefore 


4AN 


An, = Fo (5.6) 


In two dimensions the situation is more complex but involves 
identical principles. Let us first discuss the system of waves in a square 
of side L (Fig. 5.3a). A- set of waves moving initially in direction OM, 
will after reflection move in the direction MıM:. After reflection at M2, 
the direction of propagation is MM, which is parallel but opposite to 
OM,, etc. For this group of waves only four directions of motion are 
possible, OM, and *M.M2. 

The four wave trains thus formed combine to produce a set of 
standing waves only if in both the x and y directions there is a whole 
number of half waves. Consider a cube of side L with perfectly reflecting 
walls in which there are electromagnetic waves polarized with the electric 
intensity in the z direction. Since the z component E, of the electric 
intensity must be zero at the bounding planes when z = 0 or L and 
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Fig. 5.3 (a) Reflection of waves inside a square. (b) Standing waves in 
the zy plane of a cube. 


y = 0 or L, the allowed solutions of the wave equation are of the form 


E, = f(z L) sin = z sin + y 


where n, and n, are integers; this is the equation for a standing wave. 
The condition that there be nodes at the walls of Fig. 5.3b puts a limita- 
tion on the allowed wavelengths; specifically, the projection of any side 
along the propagation direction must be an integral number of half 
wavelengths. In Fig. 5.3b the heavy arrow represents one of the propaga- 
tion directions, which makes angle a = 37° with the z axis; for this 


particular case there are four half wavelengths associated with the z 
direction and three with y. 


From the figure it is clear that \. = /cos a and 


À À 


cosB sing 


ày = 


Since sin (nzrxz/L) = sin (2rz/)-), we have n.r/L = (2r cos a)/\; simi- 
larly n/L = (2r cos 8)/d. If we solve for n, and n, square each, 
and add, we have 


4L? 2 
n? +n = Fo (cos? a + cos? 6) = = (cos? æ + sin? a) = = 


This method of constructing standing waves can be extended to a 
three-dimensional cube, for which eight sets of waves are involved. 
The condition n/L = (2r cos y)/\ must be applied, and since 
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cos? æ + cos? B + cos? à = 1, we find 


Ls 
na +n +n = 42 (6.7) 


From this equation we can find the number of possible wavelengths 
which exceed a given value àm. This number is equal to the number 
of possible combinations of the positive integers n., ny n, which make 
the left-hand member of Eq. (5.7a) less than 4L2/),,?. To find this 
number, imagine each set nz, ny, n: represented by a point on the three- 
dimensional plot of Fig. 5.4. These points lie at the corners of cubic 
cells of unit length. The number of values of nz, n,, n: which give 
A > Am is therefore just the number of points inside a sphere of radius 
2L/hm divided by 8 (since we are interested only in the all-positive 
octant), which is 


_ (4r8L*\1 _ 4r 
ae (FTN 


Differentiation of this equation gives the number of wavelengths 
in a range from à to \ + dd; thus 


dn = eth (vol of cube) (5.8) 


Fig. 5.4 The Rayleigh scheme for counting the 
number of allowed values of nz, ny, n, for which 
nz? + n,? + n is less than r?. 
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where both dn and dà are taken as Positive for convenience. For 
electromagnetic waves there are two polarizations per allowed wave- 
length, and so we finally obtain for the number of degrees of freedom per 
unit volume for standing waves in the enclosure with wavelength from 
A to à + dd or frequency from » to v + dy 


8r dd 
x 


90) d = g0) dy = $ de 6.9) 


5.4 The Rayleigh-Jeans Radiation Law 


In an isothermal enclosure, the radiation is constant both in quantity 
and in spectral characteristics, so that any energy absorbed is reradiated. 
The result is the same as if all the energy were reflected at the walls, 
Rayleigh and Jeans assumed that oscillators in the wall absorbed and 
emitted radiation constantly, with each oscillator having its own char- 
acteristic frequency. For continuous Operation of any given oscillator, 
standing waves must be set up in the enclosure. However for any 
enclosure of reasonable size the differences between neighboring fre- 
quencies are so small that the radiation appears to be continuous. 

The principle of equipartition of energy requires assigning to these 
oscillators 44T of kinetic energy per degree of freedom plus another ¿kT 
for potential energy. Assignment of an average energy of kT to each 
mode of vibration leads to an energy density U, dì for waves with wave- 
length between \ and à + dì given by 


Uy dd = kTg(d) dd = 8rkTA—* dy (5.10) 


Equation (5.10) is the Rayleigh-Jeans formula for blackbody radia- 
tion. It will be noted that it contains no new constanis. At long wave- 
lengths this formula agrees with observation, but near the maximum in the 
spectrum and at short wavelengths it gives much too large values. 
Furthermore, it assigns no maximum at all to Uj. In fact, the energy 
density U in the enclosure would be infinite; for there is no lower limit 
to the possible wavelengths, and U = ip U, d\— ©, In reality, the 
radiation density in an isothermal enclosure is ordinarily much less than 
the energy density due to thermal agitation of the molecules of a solid 
body; e.g., at 1000°K U is about 7.5 X 10~* J/m?, as compared with a 
total density on the order of 10° J /m? in iron. At temperatures above 
10°°K the comparison would be reversed. Infinite energy in the electro- 
magnetic field would make the specific heats of all material bodies 
infinite. 
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5.5 Planck’s Investigation of Blackbody Radiation 


The correct blackbody formula was discovered in 1900 by Max Planck. 
By introducing a radical innovation quite at variance with previous con- 
cepts, he found a formula in complete agreement with experiment. This 
was the birth of the quantum theory. 

The experiments of Hertz on electromagnetic waves had seemed to 
give final confirmation to the electromagnetic theory of light, and this 
convinced Planck that the key to the blackbody spectrum would be found 
in the laws governing the absorption and emission of radiation by electric 
oscillators. We may imagine that the walls of an isothermal enclosure 
contain oscillators of all frequencies, essentially similar to the hertzian 
oscillator, and that the emission and absorption of radiation by the walls 
are caused by these oscillators. 

Investigation by means of electromagnetic theory led Planck to the 
conclusion that an oscillator, in the long run, would affect only radiation 
of the same frequency as that of the oscillator itself. In the state of 
equilibrium, there would be a definite ratio between the density of radia- 
tion of any frequency v and the average energy of the oscillators of that 
frequency. The problem of the blackbody spectrum was thus reduced 
to the problem of the average energy of an oscillator at a given tempera- 
ture. If he then assumed the classical value kT for this average energy, 
as derived from the equipartition of energy, he was led to the Rayleigh- 
Jeans formula. 

Planck, however, did not accept the principle of the equipartition of 
energy for oscillators. On the basis of a different assumption he was led 
at first to Wien’s formula for the radiation density, Eq. (5.4). In 1900 
new measurements of the blackbody spectrum by Lummer and Pringsheim 
and by Rubens and Kurlbaum showed definitely that Wien’s formula 
is not correct. Planck discovered an empirical modification of Wien’s 
formula that fitted the observations. Then he sought to modify the 
statistical theory of the distribution of energy among a set of oscillators 
so that the theory would lead to his new formula. He succeeded in doing 
so only after making a new assumption that broke drastically with classical 
principles. To facilitate understanding his assumption, a brief descrip- 
tion of the statistical theory will be given. 


5.6 Distribution of Energy among Oscillators in 
Thermal Equilibrium 


Consider a linear oscillator of mass m that vibrates in simple harmonic 
motion; let its displacement be « and its momentum p = m dzx/dl. The 


¢ 
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potential energy of the oscillator can be written in the form 3bz?, b 
denoting the force constant; its kinetic energy is 3m(dx/dl)? = p?/2m. 
Thus its total energy is 


E q lor 610 
2m 2 4 


and by elementary theory its frequency of vibration v has the value 


1 [6 
_ V2 (5.12) 


When a large number of such oscillators are in thermal equilibrium, 
the energy of an individual oscillator varies widely but the energies of 
the entire group are distributed in energy according to the Maxwell- 
Boltzmann distribution (Sec. 4.6). By a contraction of Eq. (4.19) to 
two phase-space dimensions we find the number of oscillators dN with 
coordinates between x and x + dz and momentum between p and p + dp 
to be 


dN = NAe-*? dz dp (5.13) 


_ where A is a proportionality constant such that f dN = N. 


Since we are interested only in the energies of the oscillators, it is 
convenient to throw the formula into a different form. Let us construct 
a plot on which x and p are taken as cartesian coordinates (Fig. 5.5). 
Each oscillator is represented by a point on this plot which moves about 
as the oscillator vibrates. So long as the oscillator is free from disturbing 
forces, x and p change in such a way that the energy remains constant; 
hence the representative point moves along an ellipse given by Eq. (5.11) 
with a fixed value of e Let two such ellipses be drawn for slightly 
different energies e and e + Ae. Throughout the elliptical ring between 
these ellipses, if Ae is very small, the quantity e~*/*? is sensibly constant; 
hence, in a state of thermal equilibrium, the number of oscillators 
represented by points in this ring is, by Eq. (5.13), 


AN = NAe-*?Tff dx dp (5.14) 
The integral ff dz dp is the area of the elliptical ring, and its value 
is easily found in terms of Ac. We first note that the area inside an ellipse 


whose semiaxes are £m and Pm is 


S = 1Pntm (5.15) 
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P 


Fig. 5.5 The momentum-coordinate plane for a harmonic 
oscillator. 


Here £m and pm also represent the maximum values of x and of p, respec- 
tively, during a vibration of an oscillator having an energy €; substituting 
in (5.11) first £ = £m and p = 0 and then z = 0 and p = Pm, we find 


z= fe Pm = Vme TESNE (5.16) 


by (5.12). The area of a ring corresponding to an increment Ae is thus 
ff dz dp = AS = Ac/v. Hence (5.14) can be written in the form 


AN = NAT Ae (5.17) 


where A; = A/». 

The average energy of the oscillators can now be found by sum- 
ming their individual energies and dividing by NV. Ordinarily this would 
be done by means of integrals, but for our present purpose we employ 
discrete sums. Let ellipses centered at the origin be drawn so as tO 
divide the zp plane into rings of equal area h; and let these rings; of 
which the innermost is actually an elliptical area, be numbered 0, 1, 

. 2, . . . from the center outward, as in Fig. 5.5. Then the total area 
inside ring number 7 is S = zh; and, by Eq. (5.16), the energy of an 
oscillator represented by a point on the inner boundary of ring number 7 is 


e = Sv = thv 


aft 
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The number of oscillators N, on ring r can thus be written, from (5.17), 
N, = Norit 


where No replaces NA, Ae. The total energy Æ of all oscillators is then, 
approximately, 


E= S rho, = Nohve-*?(] + Qe-AvlkT 4. Be-AvlkT 4...) 
’ = Nohve™ikT(] — e—ħri+T)—2 
since by the binomial theorem the last series is of the form 
1+ 2r t32 + OETA 
In a similar way it is found that 
N = 2N, = Na(l + eT + + + +) = No(L — e—»iT)-1 
For the average energy per oscillator we have then, finally. 


E hy 

¿E= N = gener — J (5.18) 

This expression represents also the energy of a particular oscillator aver- 

aged over any length of time that is not too short. For, being similar, 

the oscillators will all have the same average energy in the long run, and 
this average must be E/N. 

In classical theory it is now necessary toleth— 0. The approxima- 


tions made then disappear. Using theseriese* = 1+2+27/2+---, 
we find that, to the first order in h, 
hy 
Fite 7] SS Ae 
e 1 ET 


Hence, in the limit as h — 0, e = kT. This is the same value for the 
average energy of a harmonic oscillator that was deduced from the equi- 
partition of energy in Sec. 5.4, and it leads to the incorrect Rayleigh- 
Jeans formula. 


5.7 Planck’s Quantum Hypothesis 


The new assumption introduced by Planck was equivalent to keeping h 
finile in the preceding formulas. In the first formulation of the new 
theory, Planck assumed that the oscillators associated with a given ring 
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all have the energy proper to the inner boundary of that ring. Then 
Eq. (5.18) for e holds exactly. According to this assumption, the energy 
of an oscillator cannot vary continuously but must take on one of the 
discrete set of values 0, hy, 2hv . . . , rhv», . . . . The actual original 
form of Planck’s assumption was that the energy of the oscillator must 
always be an integral multiple of a certain quantity éo, but he then showed 
that for oscillators of different frequencies, eo- must be proportional to v 
if the radiation law is to harmonize with the Wien displacement law. 
Thus he assumed that ¢9 = hv, where h is a constant of proportionality. 
The connection between h and areas on the zp plane for the oscillator, 
as described above, was recognized by Planck later. 

It must be emphasized that the assumption of a discrete set of 
possible energy values, or energy levels, for an oscillator was completely 
at variance with classical ideas. According to this assumption, if the 
energies of a large number of oscillators were measured, some might be 
found to have zero energy, some hy each, others 2hy, and so on. But 
not a single oscillator would be found which had energy, say, 1.73h». 
When the energy of a given oscillator changes, therefore, it must change 
suddenly and discontinuously. According to the older conceptions, the 
interchange of energy between two systems, e.g., between one gas molecule 
and another or between radiation and oscillators, is a continuous process, 
and the energy of an oscillator would likewise vary continuously. Such 
continuity of energy values is demanded by classical physics. For exam- 
ple, the electric and magnetic vectors in a light wave may have any values 
whatsoever, from zero up; and, accordingly, the wave may have any 
intensity, from zero up. The emission and absorption of this energy by 
the walls of an enclosure should, likewise, be a perfectly continuous 
process. 

The problem of the absorption and emission of radiation presented 
serious dilliculties for the new theory. If the energy of an oscillator 
can vary only discontinuously, the absorption and emission of radialion 
must be discontinuous processes. As long as the oscillator remains in 
one of its quantum stales, as we now call them, with its energy equal to 
one of the allowed discrete values, it eannot be emitting or absorbing 
radiation according to the laws of classical physics, for then the conserva- 
tion of energy would be violated. This is contrary to classical electro- 
magnetic theory, which requires an isolated, accelerated electric charge to 
radiate energy- ask Aida 

According to Planck’s new theory, emission of radiation occurs only 
when the oscillator “jumps” from one energy level to another; if it jumps 
down to the next lower energy level, the energy hy that it loses is emitted 
in the form of a short pulse of radiation. Absorption was also assumed at 
first to be discontinuous. An oscillator, Planck assumed, can absorb a 
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quantum hy of radiant energy and jump up to its next higher energy 
level. This assumption met with special difficulties, however. For the 
quantum of radiant energy emitted by an oscillator, according to the 
classical wave theory, would spread out over an ever expanding wavefront, 
and it is hard to see how another oscillator could ever gather this energy 
together again so as to absorb it all and thereby acquire the energy for 
an upward quantum jump. Absorption ought, therefore, on Planck’s 
theory, to be impossible. 

To avoid this difficulty, Planck later modified his theory so as to 
allow the oscillators to absorb in a continuous manner, only the process of 
emission being discontinuous. The energy of an oscillator could then 
take on all values, as in classical theory; but every time the energy 
passed one of the critical values rhv, there was assumed to exist a certain 
chance that the oscillator would jump down to a lower energy level, 
emitting its excess energy as a quantum of radiation. This came to be 
known as the second form of Planck’s quantum theory. It can be 
shown that in this form of the theory, the oscillators are evenly distrib- 
uted over each ring on the pz plane, instead of being all on the inner 
boundary, and the mean energy of all oscillators is, instead of the value 
given by Eq. (5.18), 


3 h 1 = 
i Sees + 3 hy (5.19) 


i | 


Having read thus far, the student may perhaps have reached a state 
of confusion as to what were the essential assumptions of Planck’s quan- 
tum theory! This confusion can be no worse than that which existed 
in the minds of most physicists in the year, say, 1911. The situation was 
made still more puzzling by the success of Einstein’s theory of the photo- 
electric effect (Sec. 6.7); for Einstein assumed not only that radiation 
came in quantized spurts but that each spurt was closely concentrated 
in space, contrary to the wave theory. Confusion usually reigns while 
important physical advances are being made; it is only afterward that a 
clear-cut logical path can be laid down leading straight to the goal. 

One of the aims of this book will be to show how the theory gradually 
became clarified. It will be found that the following two new ideas 
introduced by Planck have been retained permanently and form a part 
of modern wave mechanics. 


1. An oscillator or any similar physical system has a discrete sel of possible energy 
values or levels; energies intermediate belween these allowed values never occur. 

2. The emission and absorplion of radiation are associated with transitions, or 
jumps, belween two of these levels, the energy thereby lost or gained by the oscillator being 
emilled or absorbed, respectively, as a quantum of radiant energy, of magnilude hy, v being 


the frequency of the radiation. 
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The difficulties with the theory of electromagnetic waves were overcome 
when the theory of wave mechanics was applied to the electromagnetic 
field itself. 

It should be emphasized that Planck’s revolutionary assumptions 
were not based upon an extension of the ordinary lines of reasoning of 
classical physics. Quite the contrary; they represented an empirical 
modification of classical ideas made in order to bring the theoretical 
deductions into harmony with experiment. Had the magnitude of the 
quantum of energy turned out to be not hy but something independent 
of the frequency, the new theory might well have taken the form of a 
simple atomicity of energy, similar to the atomicity of electricity repre- 
sented by the electronic charge. Such is not the case, however. Rather, 
it is the new universal constant h that represents the essentially new 
element introduced into physics by the quantum theory. We shall find 
h playing an important part in a wide variety of atomic phenomena. 


5.8 Planck’s Radiation Law 


Planck derived his new radiation formula by considering the interaction 
between the radiation inside an isothermal enclosure and electric oscil- 
lators which he imagined to exist in the walls of the enclosure. A more 
direct and equally satisfactory procedure is to combine Planck’s new 
expression for the mean energy of an oscillator with the analysis of the 
electromagnetic field by the method of Rayleigh and Jeans, in which the 
various modes of oscillation of the field inside an enclosure are ireated 
as if they were oscillators. In Sec. 5.3, Eq. (5.9), we found that there 
would be 8r dd/d* such modes of oscillation or degrees of freedom per 
unit volume in the wavelength range à to\ + dì. If we multiply this 
number by é as given by Eq. (5.18), we obtain 


fee dy hy 
U, dd = Ora gE 


Let us substitute here v = c/A, c being the speed of light in vacuum. 
Thus we obtain, as Planck’s new radiation law, 


8rch 1 
= ms (5.20) 


[Strictly speaking, we should have used for é the value given by Ea- 
(5.19), which agrees with the value obtained from wave mechanics. The 
effect of this change would be to add in U a term independent of tempera- 
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ture. Since only changes in U are perceptible, this term would be with- 
out physical effect.] 

Planck’s formula reduces to Wien’s formula near one end of the 
spectrum and to the Rayleigh-Jeans formula near the other end. In 
Fig. 5.6 is shown a comparison between the several spectral-energy dis- 
tribution formulas and the experimental data. The circles show observa- 
tions by Coblentz on the energy distribution in the spectrum of a black- 
body at 1600°K. The reason that the curve for Planck’s formula drops 
below the Rayleigh-Jeans formula is the failure of the classical principle 
of the equipartition of energy. The high-frequency modes of oscillation 
of the electromagnetic field in the enclosure, which should all have mean 
energy kT according to classical theory, remain almost entirely in their 
lowest quantum states and so contribute little to the observable density 
of radiant energy. 

Planck’s formula has the form that we found to be required by the 
principles of thermodynamics (Sec. 5.2); for Planck’s formula can be 


Hy 
or 
e 
si i 
0 1 2 3 4 -$ è 
Wavelength, p 


Fig. 5.6 Comparison of the three radiation Jaws with experiment at 1600°K. 


j 


134 Introduction to Modern Physics 


written 


8rch 1 
U= TJAT) JOT) = Gram (5.202) 


From the formula in this form, both the Stefan-Boltzmann formula and 
the Wien displacement law follow as mathematical consequences. 

The value of the new constant h can be determined by comparing 
the observed values of U, with the formula, in which the values of all 
other quantities are known. In his original paper (1901) Planck obtained 
in this way the value h = 6.55 X 10-** The accuracy of thermal data is 
relatively low, however, and better methods yield the value 


h = 6.6256 X 10-*4 J-s 
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Fig. 5.7 Spectral distribution U, of blackbody radiation normalized to make U unity 
(left scale) and fraction of energy radiated with z (= hv/kT) less than yalue given on 
abscissa (right scale). 
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It is often desirable to write the Planck law in terms of frequency 
rather than à. In this case the energy density U, in the range between 
v and y + dr is 


8rhv? dy 


U, dv = aen T) (5.21) 


The total energy density U is 


2 8rktT* po xi dz 85 R4T* =, 
U= f d=" h et oR (6.22) 


where x = ch/dkT and the integral has the value x‘/15. The normalized 
spectral distribution of a blackbody is shown in Fig. 5.7 along with a 
curve showing the fraction of the energy radiated at x less than a pre- 
scribed value. 

As is shown in Sec. 5A.1, Eqs. (5A.4) and (5A.5), the power radiated 
per unit area, total, or in a given à or » interval, is just c/4 times the 
appropriate energy density. Thus the power radiated per unit area is 
Rs = cU/4, and we have recovered Eg. (5.1) with the Stefan constant 
expressed in terms of r, k, c, and h. 


Appendix 5A - Classical Radiation Theory 


5A.1 Pressure and Energy Flux Due to Isotropic Radiation 


Suppose a stream of radiation in a vacuum falls normally on the surface 
ofa body. Then, if w is the mean energy density in the oncoming waves, 
they carry also w/c units of momentum per unit volume (Sec. 2.12). 
Thus the waves bring up to each unit area of the surface, along with cw J 
of energy, w units of momentum per second, the momentum as a vector 
being directed normally toward the surface. If the waves are absorbed 
by the surface, it receives this momentum and experiences, therefore, a 
pressure equal to w. 

Suppose, next, that the radiation is incident at an angle 6. Then the 
energy that crosses a unit area drawn perpendicular to the rays (PQ in 
Fig. 5A.1) is received by a larger area of magnitude 1/cos ¢ on the 
surface PR. Furthermore, the component of the momentum normal to 
the surface is less than in the case of normal incidence in proportion to 
cos 0. Thus the momentum in the direction of the normal that is _ 
delivered to unit area of the surface per second is decreased by the 
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obliqueness of incidence in the ratio cos? 8, and the resulting pressure, 
if the radiation is entirely absorbed, is 


p = w cos? 6 


The same expression ho!ds for the pressure caused by the emission of a 
beam at an angle 6 or for the additional pressure caused by the occur- 
rence of a reflected beam. If an incident beam is specularly reflected 
from a surface at the angle of incidence 9, the total pressure on the 
surface is 2w cos? 6. 

Finally, let radiation be streaming toward a surface and also away 
from it with equal intensities in all directions, as in an isothermal enclo- 
sure. Such a distribution of radiation is equivalent to a large number of 
beams of plane waves, all of equal intensity, with their directions of 
propagation distributed equally in direction. Let there be NV beams in 
all, and let the energy density due to any one of them be w. Then the 
total energy density U just in front of the surface and the pressure p 
on it‘are, respectively, 


U = Nw (5A.1a) 
P = Zw cos? 6 = wÈ cos? 0 (5A.1b) 


To find this latter sum, imagine lines drawn outward from a point O 
on the surface to represent the various directions of the beams, whether 
moving toward the surface or away from it, and then about O as center 
draw a hemispherical surface of unit radius with its base on the surface 
(see Fig. 5A.2, where only two of the lines are shown). From the hemi- 
sphere cut out a ring-shaped element of area QS by means of two cones 
of semiangle @ and 0 + dé, drawn from O as apex and with the normal 
OP as axis. The edge of this element is a circle of perimeter 27 sin 0, and 
its width is d0; hence its area is 2x sin 6 d9, whereas the area of the whole 
hemisphere is 2x. Now the lines of approach of the N beams of radia- 
tion, if drawn through O, will cut the hemisphere in points equally dis- 
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tributed over its surface. Hence, if we let dN denote the number of 
these lines that pass through the ring-shaped element, dN will be to 
N in the ratio of the area of the ring to the area of the hemisphere, so 
that 


dN _2rsinodo__. 
N ope node 


The value of cos? 6 is the same for all the dN beams. Hence their con- 
tribution to Z cos? 0 is cos? 0 dN or, from the last equation, N cos? 6 
sin 0 d0. Thus 


¥ cos? 0 = f cos? 8 dN = NA cos? 0 sin 0 dð = 4N (5A.2) 


(The limit is +/2 because directions all around the normal OP are included 
in the ring.) For the pressure we thus obtain, from Eq. (5A.1), p = iw, 
or 


p=3U (5A.3) 


Thus the pressure on the walls of an isothermal enclosure equals one- 
third of the radiant-energy density at any interior point. 

It will be useful to calculate also the total energy brought up to the 
surface. Since half of the N waves are moving toward the surface and 
each wave delivers energy cw cos @, the total energy brought up to unit 
area per second is 


4N ie cw cos @ sin 0 d0 = łcwN = cU 


by (5A.la). In an isothermal enclosure an equal amount of energy is 
carried away from the surface. It can be shown in the same way that 
in an isothermal enclosure in which the energy density is U, energy 
acU per second passes in each direction across unit area of any imaginary 
surface drawn inside the enclosure. 


Fig. 54.2 
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Furthermore, the energy emitted per second by unit area of any 
black surface, which is its radiant emittance R, is related to the energy 
density U in an isothermal enclosure having the same temperature by 
the equation 


R = 4cU (5A.4) 


The same relation holds for each wavelength separately. Let the energy 
emitted from the blackbody with a range of wavelengths dd be denoted 


by ex dà (so that R = i ey dx) and let the density of radiant energy in 
the enclosure within the same range of wavelengths be similarly repre- 
sented by Uh dì (so that U = f, Ux dn). Then 


ex = cU, i (5A.5) 


5A.2 The Stefan-Boltzsmann Law 


In 1884, Boltzmann deduced a theoretical law for the variation of the 
total intensity of blackbody radiation with temperature. For this pur- 
pose, he applied the laws of the Carnot cycle to an engine in which the 
radiation played the part of the working substance. 

The ideal Carnot engine consists of an evacuated cylinder with walls 
imper vious to heat, a piston likewise impervious to heat and moving 
without friction, and a base through which heat may enter or leave. 
Let the walls, piston, and base be perfectly reflecting except for a small 
opening O in the base, which can be covered at will by a perfectly reflect- 
ing cover. Let this cylinder be placed with the opening O uncovered 
and opposite an opening'in an evacuated isothermal enclosure By, which 
is maintained at temperature T, (Fig. 5A.3). Then the cylinder will fill 


Py 
P,?,T; 
10; 
By B, 
T; T 
(a) (b) (e) (d) 


Fig. 5-4.3 Boltzmann's radiation engine- 
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Fig. 54.4 The p-v diagram for the 
Carnot cycle of the radiation engine. 


up with radiation entering it through O from Bi until there is the same 

density U: of radiation in the cylinder as there is in By, at which time 

radiation will be passing at the same rate from O to By as from B; to O. 
We may now consider the following cycle of events: 


1. Starting with the piston in the initial position P, (Fig. 5A.3a), the initial 
volume of the cylinder being vı and the initial pressure due to the radiation 
pı = $U\, we cause the piston to move upward, slowly, until position P+ is 
reached, the volume increasing to v2. During this process the radiation density 
within the cylinder remains constant at U;. To keep it constant, additional 
radiation must enter the opening O from the enclosure B,, for two reasons: 

a. Work W. is done by the radiation on the piston. If T, remains con- 
stant, so do U, and p, and 


W. = pi(vs — v) = ZU (v2 =v) 


b. The volume of the cylinder has increased by v2 —.vı, which requires 
an additional influx of energy equal to U;(v2 — vı). Thus the total influx H, of 
radiation from Bı must be 


H, = £U,(v2 — 01) (5A.6) 


This isothermal process is represented on the p-v diagram (Fig..5A.4) by 
the horizontal line P:P:. The energy H, is equivalent to heat supplied to the 
space within the cylinder, just as in an ordinary Carnot cycle the first isothermal 
expansion is accompanied by an absorption of heat. An amount of heat equal 
to H, must also be supplied from external sources to B, in order to keep the tem- 
perature of B, constant. 

2. When the piston has reached P», the perfectly reflecting cover is placed 
over the opening O (Fig. 5A.3b), thereby effecting complete thermal isolation of 
the interior of the cylinder, and a further expansion to position P, is made. 
External work is done, as before, on the piston, the energy required for this 
external work being supplied by the radiation. Partly because of this work and 
partly because of the increase in volume, the energy density of the radiation within 
the cylinder must decrease from U; to some smaller value U;. The pressure, like- 
wise, has decreased. This is obviously an adiabatic process. It is represented 
in Fig. 5A.4 by the line P2Ps. 
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The new energy density U2 is now equal to the energy density in an enclo- 
sure at a certain new temperature T+. If the expansion during this second 
process was very small, we may represent the change in temperature T, = T: 
by dT and the corresponding change in energy density U, — Uz by dU. Since 
p = 4U, we have then 


dp = 3dU (5A.7) 


dp representing the change in radiation pressure. k 

3. The engine is now placed opposite a second isothermal enclosure B, 
(Fig. 5A.3c) at temperature Tz, the slide is removed from the opening O, and the 
piston is moved, by the application of suitable external force, from P; to Py. On 
account of this compression, there is a tendency for the density of radiation 
within the cylinder to rise and for radiation to pass through O into B}. The 
compression is supposed to take place so slowly, however, that the radiation 
density remains constant at a value only infinitesimally in excess of U>. During 
this second isothermal process, radiant energy in amount H; leaves the engine. 

4. The piston having reached a suitable point P4, the opening O is closed, 
and the radiation is then compressed adiabatically until the initial position P, 
is reached. 


The net external work done during this cycle is represented by the 
area P:P:P;P, of Fig. 5A.4. If we assume the change of pressure to 
have been very small, this area equals (v — v) dp. Calling the net 
external work dW, we have, therefore, 


dW = (v — vı) dp = $(v2 — v) dU 


by (5A.7). Hence, by the usual rule for a Carnot cycle, 


and, using the value found for dW and also Eq. (5A.6), we have 


dU _ 4dT 
Tee oT. 


Thus, dropping the subscript, 


Integrated, this equation gives log U = 4 log T + const or 


U = aT* (5A.8) 


where a is a constant, not yet known. From Eq. (5A.4) we have then 
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also, for the emissive power or radiant emittance 


R=oT* (5A.9a) 
o = xa (5A.9b) 


Thus both the energy density of the radiation within an isothermal enclosure 
and the total emissive power of a blackbody are proportional to the fourth 
power of the absolute temperature T. This is the Stefan-Boltzmann law. 


5A.3 Reflection from a Moving Mirror 


In the preceding discussion no attention was paid to the spectral distri- 
bution of the radiation. The question presents itself, however, whether 
the same law can be applied also to the separate wavelengths. In order 
to investigate this question, we need to know what happens to the 
spectral distribution of a beam of radiation when it is reflected from a 
mirror that is in motion, such as the piston in the ideal apparatus that 
was described in the last section. 

Let us consider first the effect of such motion upon a monochromatic 
beam. For this purpose we employ Huygens’ principle. In Fig. 5A.5a, 
MM represents a mirror moving with a component of velocity V perpen- 
dicular to its plane. AB and DE represent parts of two incident waves 
which are one wavelength, or a distance à, apart and are falling on the 


Fig. 5A.5 Reflection from a moving mirror. 
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mirror at an angle of incidence 6; CB and FE are parts of the same 
waves which have been reflected and are now leaving the mirror at an 
angle of reflection 6’, a distance à’ apart. It is obvious from the figure 
that à = BE sin 0, X = BE sin 6’, whence 


=> (5A.10) 
sin ð 


 _ sin & 
N 
To obtain a second relation between }’ and 6’, consider two succes- 

sive positions of the same wave. In Fig. 5A.5b, the part AB of a wave 
is just beginning to fall on the mirror, whose instantaneous position is 
MM. The same portion of the wave at a later instant, after it has been 
reflected at the angle 6’, is shown by A'B’, the point A being now in 
contact at A’ with the mirror, which is in the new position M’M’. While 
A traveled along the ray AA’, B traversed BB’; hence AA’ = BB’. 
The angles BAA’ and BB’A’, being angles between ray and wave, are 
right angles. It follows, therefore, by similar triangles, that angle 


B’A'B = ABA’ 
or sete 

6 —e=O0+e ~(6A.11) 
where «e is the angle QBA’. Furthermore, while A went from A to A’ at 


the speed c of light, the mirror moved from MM to M’M‘ at the speed u. 
Hence, if A’Q is a perpendicular dropped from A’ onto MM, 


V_A'Q___BA'sine sin e n 
c AA BA‘sin(@+6 sin(0+¢) (5A.12) 


By eliminating « between the last two equations, the equation 


tango =£ ite V tan 40 (54.13) 


is obtained, expressing the law of reflection from a moving mirror. 
Finally, from Eq. (5A.10), for the change of wavelength Aà caused 
by reflection, we find 


Ad AN —  _ sin 6’ — sin 0 
X W sin 6 


We need the value of AX, however, only for an infinitesimal value of V 
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and hence of #’ — 0. For such a value 
e ; ; d. : 
sin 6’ — sin 6 = (6 — 8) a5 sin 6 = (6’ — 6) cos € 
Also, from Eqs. (5A.10) and (5A.11), in which ¢ is an infinitesimal, 
Ves 
Of — 0 = 2.= 2sine = 2— sin (@+ 6) 


to the first order; and here sin (0 + e) may be replaced by sin 6. We 
thus find, to the first order in V, 


Ad = 2 i à cos 6 (5A.14) 


54.4 Effect of an Adiabatic Expansion upon Blackbody Radiation 


Returning now to the sequence of operations described in Sec. 5A.2, let 
us consider the effect of the adiabatic process (step 2) upon the spectral 
distribution of the radiation. In this process, blackbody radiation 
initially at temperature T, imprisoned in a cylinder with perfectly reflect- 
ing walls is slowly expanded from an initial energy density U, to a new 
energy density U. Let the former restriction to a small expansion be 
dropped. The change in direction of the rays that is produced by the 
moving piston, according to Eq. (5A.13), will tend to make the radiation 
no longer isotropic. We can obviate this inconvenient effect, however, 
by letting part of the walls of the cylinder reflect perfectly bul diffusely. 
A surface of magnesium oxide does this very well. Then, if the expansion 
is made very slowly, because all rays (except a negligible few) strike the 
diffusing surface repeatedly, the radiation will be kept effectively iso- 
tropic; and the pressure on the piston, according to Eg. (54.3), will be 
at all times equal to 3U. 

Let the cylinder have a cross section A and a (variable) length l. 
Then, if U is'the energy density at any moment, when the piston moves 
outward a distance dl, work p dv = LUA dl is done on it by the force 
due to radiation pressure. This work is done at the expense of the 
enclosed energy, the total amount of which is LAU. Hence 


4UA dl = —d(IAU) = —AU dl — Al dU 
dU _ _4dl 
Us St 
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and,after integration, 


~~ log U = — log i! + const (5.15a) 
U=cr (5.15) 


C denoting a constant. 

Now it can be shown by thermodynamic reasoning that an. pansion 
of the type considered here cannot destroy the blackbody property of 
the radiation. For, at a certain instant, suppose that the expansion has 
reduced the total energy density to Us, and let T, be the temperature of 
an enclosure in which the density has this same value. Suppose that in 
the cylinder there were more radiation per unit volume of wavelengths 
near some value ’ than at the same wavelengths in the enclosure and 
less radiation near some other wavelength A”. It would then be possible 
to cause a little radiation to pass from the cylinder into a second enclo- 
sure at a temperature T, slightly above T, by covering the opening in 
the base of the cylinder with a plate transmitting wavelengths near \’ 
but reflecting all others and putting the cylinder into communication 
with the second enclosure through this opening. In a similar way, 
enough radiation near A” could be passed info the cylinder from an 
enclosure at a slightly lower temperature T; to restore the total energy 
to Uz. Then the radiation could be compressed back to Ui, the changes 
in U and land the amount of the work done being just the reverse of these 
quantities during the expansion. Finally, putting the cylinder again into 
communication with the enclosure at T,, we could allow the spectral dis- 
tribution to be restored to that proper to a blackbody at T, but without 
any net transfer of energy between cylinder and enclosure, since the total 
energy density has already been restored to that corresponding to T}. 
Thus we should have performed a cyclic operation, the only effect of 
which is to transfer heat energy from an enclosure at T; to one ata 
higher temperature 77. But this is inconsistent with the second law of 
thermodynamics. j 

Thus black radiation must remain black during any slow adiabatic 
expansion or compression. Its density and temperature, however, 


decrease. In the case under discussion, the last equation, in combina- 
tion with Eg. (5A.8), gives 


1 
T« T (5A.16) 
To determine the effect on the spectral distribution, we must find 
the average rate at which wavelengths are increased. Suppose, first, 
that the walls of the cylinder and the piston reflect specularly. Then 
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Fig. 54.6 


any ray preserves its angle of inclination to the axis of the cylinder @ in 
spite of repeated reflections (sce Fig. 5A.6) and has, therefore, a constant 
component of velocity c cos @ perpendicular to the piston. The ray 
strikes the piston (e cos @)/2l times a second, and its wavelength, accord- 
ing to (5A.14), increases each time by (2V/ cos 6)/e, V being the speed 
of the piston. Thus its wavelength increases at a rate 


dys _ccos@2VAcos@_ VA, 
Flom aol A T cos’ 8 


If rays of wavelength \ are equally distributed in direction, the average 
value of cos? @ for all of them is 3, according to Eq. (54.2); hence the 
average value of d\/dl for these rays is 


since V = dl/dl. 

To simplify the calculation, suppose now that there is a diffusely 
reflecting spot on the walls. Then all the rays will take turns moving 
in the various directions, and all rays of wavelength à will undergo the 
average change of wavelength just calculated. (This conclusion can be 
confirmed by a more complete analysis for which we have no space.) 
The last equation can be integrated thus: 


dà _ldl 


com (5A.17a) 
log à = log l + const (5A.17b) 
uarie 7 (5A.170) 


by (5A.16). Thus we have the conclusion that each spectral component ` 
of blackbody radiation must change in wavelength in such a way that 


à< LT. 
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54.5 The Wien Displacement Law 


Now let us fix our attention on a particular spectral range from to 
M1 + di, containing energy Uy: dM per unit volume in an enclosure at 
temperature Tı. An adiabatic expansion which lowers the temperature 
to Tz changes the limits of this range, according to the result just reached, 
to Az and àz + dds, where 


det du _ A. 
NEN T (SA18) 
Therefore 
es = E (5A.18b) 


The energy in dA, is at the same time decreased, and in the same ratio as 
the total energy is; for in the argument that led up to Eq. (5A.15), we 
might have started with only the radiation in dd present in the cylinder. 
Hence, Us: being the new value of Ur, 


Urdi _ U: TA 


Und U Ti 
and using (5A.18b), 

Urz _ enm _ Tè 

UTs (5A.19) 
Here e, is the emissive power of a blackbody, given in terms of U, by 
Eq. (5A.5). 

Thus, if the values of U, or e, at two different temperatures are 
compared not at the same wavelength but at wavelengths inversely pro- 
portional to 7, then their values will be found to be proportional to the 
Jifth power of the absolute temperature. 
the Wien displacement law. 


This conclusion is known as 
It can also be expressed by saying that at a 
wavelength varied so that the product AT is held constant, U,/T® or 
e,/T® has the same value at all temperatures. Thus if either of these 


ratios is plotted against AT as abscissa, a single curve valid at all tempera- 
tures will be obtained (Fig. 5.2). 


Problems | 


1. (a) Estimate the surface temperature of the sun, assuming it is 
essentially a blackbody of radius 7 X 108 m which is 1.5 X 101 m from 
the earth, from the fact that the solar constant (power per unit area ona 
surface normal to the sun’s rays) is 1400 W/m?. 


(b) Estimate the surface temperature of the sun from the fact that 
dn = 4900 A. 


2. Show that Planck's radiation law, Eq. (5.20), reduces to Wien’s law, 


f 
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Eq. (5.4), when hy > kT and to the Rayleigh-Jeans law, Eq. (5.10), 
when hy < kT. 

3. If the fireball of a nuclear weapon can be approximated at some 
instant to be a blackbody of 0.5 m radius with a surface temperature 
of 107°K, find (a) the total electromagnetic power radiated, (b) the wave- 
length at which maximum energy is radiated, and (c) the value of kT 
in electron volts. 

Ans: (a) 1.8 X 10” W; (b) 2.9 A; (c) 0.86 keV 

4. From Fig. 5.7, assuming that the sun is a blackbody with \m = 4900 A, 


estimate the fractions of the solar radiation in the visible (3500 to 7000 A), 
the infrared, and the ultraviolet parts of the spectrum. 


Ans: 0.43, 0.50, 0.07 

5. Show that if Xm represents the wavelength at which maximum energy 
is radiatéd in wavelength interval dà, then mT = ch/4.965k, which is 
equivalent to Eq. (5.3). 

6. A very bright star of spectral class BO has an effective surface 
temperature of 20,000°K. Find the wavelength at which e, is maximum, 
assuming the radiation is approximately blackbody. How does e, at 
5000 A for this star compare with that of the sun at this wavelength 
if the surface temperature of the sun is 6000°K? 


7. The intensity of radiation per unit wavelength interval from a 
fireball sphere of radius 2 m has been measured. When the resulting 
curve of intensity as a function of wavelength is plotted, it has its maxi- 
mum at 289.8 A and is closely approximated by the blackbody curve. 
Find the temperature and the total power radiated by the sphere at 
the instant in question. 


8. Show that the maximum of the U, curve (Fig. 5.7) in terms of 
x = hv/kT occurs for v = 2.82kT/h. 


9. Find the value of the Stefan constant ø in terms of k, c, and h by 
the use of Eq. (5.22). 


Ans: 2r*kt/15c?h? 
10. Planck originally computed his value of h and the value of the 


Boltzmann constant k by using experimental results of Rubens and 
Kurlbaum, which gave U = (7.6 X 10-18 J/m*-°K‘) T# and 


= 2.9 X 10-3 m-°K 


Find h and k from these data. 


ll. Estimate from Fig. 5.7 the fraction of the energy radiated by a 
blackbody which is in the visible region (3500 to 7000 A) for temperatures 
of 1000, 2000, and 3000°K. 
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12. Show with the aid of Eq. (5A.5) that both the Stefan-Boltzmann 
formula and Wien’s displacement law in the form of Eq. (5.3) follow as 
mathematical consequences of Planck’s formula, Eq. (5.20a). 
13. If in a blackbody cavity the number of photons per unit volume 
with wavelength between à and A + dd is Ny dà, find the wavelength 
at which M, is maximum in terms of h, k, and T. 
14. Show that in a blackbody cavity the number of photons per unit 
volume with frequency between v and v + drv is 

Srv? dv 


I eb Serer ar 


and verify that N, has its maximum for v = 1.59kT'/h. 


15. Using the result of the preceding problem, show that the number 
of photons per unit volume in a blackbody cavity is 


8rk'T* fo z?dr — 8rk*T4 
Nowa Jo == 1 a 2408 


where z = hv/kT. Verify that the integral has the value 2¢(3), where 
t(n) = 1 + 1/27 + 1/3" + - - - [¢(3) = 1.209). 

16. Using the results of the preceding problem, verify that the total 
number of photons emitted per second per unit area by a blackbody is 


and that the average energy per photon is 2.70kT. 

17. The kinelic lemperalure kT of a hot plasma or a nuclear weapon 
is sometimes used in preference to the Kelvin temperature. For a 2-keV 
weapon estimate (a) the Kelvin temperature, (b) the wavelength at 
which e, is maximum and the energy of photons of this wavelength, 
(c) the average energy per photon (see Prob. 16), (d) the photon energy 


for which U, is maximum (see Prob. 8), and (e) the photon energy for 


which the number of photons per unit frequency range is maximum 
(see Prob. 14). 


Ans: (a) 23,200,000° 
(e) 3180 eV 
18. Derive Eq. (5.22) from (5.21), showing that 


© z? dr 
kei SO 


where ¢(n) = 1 + 1/2" +1/3 4... 
[Accept the fact that ¢(4) = 14/90] 


K; (b) 1.25 A, 9920 eV; (e) 5400 eV; (d) 5640 eV; 


chapter six 


Electrons and the Photoelectric Effect 


During the nineteenth century the atomic hypoth.sis was strongly 
bultressed, and the eleclromagnelic nature of light was established. Con- 
current with these theoretical strides came great advances in experimental 
chemistry and electricity. The development of induction coils, electro- 
magnels, and vacuum pumps paved the way for experimental work in 
electrical discharge through rarefied gases, which culminaled in the dis- 
covery of the electron, of x-rays, and of radioaclivity in the concluding 
decade. Suddenly it became clear thal atoms have structure, even though the 
word “alom” originally meant “‘uncutlable.”’ 


6.1 Electricity in Matter 


The atomicity of electricity was suggested by the laws of electrolysis. 
Faraday found that when the same quantity of electricity deposits 
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univalent ions of different kinds, the amounts deposited as measured by 
weight are proportional to the ionic weights in the chemical sense. By 
the ionic weight is meant here the sum of the atomic weights of atoms, 
one or more, composing the ion. It follows that the same quantity F 
of electricity suffices to deposit a gram ion of any kind of univalent ion, 
i.e., a mass of the ions equal in number of grams to the ionic weight. 
The vaiue of this quantity F, called the faraday, is 96,487 C. 

The number of ions in a gram ion or of atoms in a gram atom is in 
all cases the same. It follows that all univalent ions carry the same unit 
of charge, usually denoted by e. For this natural unit Stoney proposed 
the name electron in 1891. 

An independent line of thought which likewise suggested the 
presence of definite electrical charges in matter was the treatment of 
dispersion in developing the electromagnetic theory of light. L. Lorenz 
suggested in 1880 that refractive media may contain small charged parti- 
cles which can vibrate with a natural frequency vs about a fixed equilib- 


rium position. By assuming one or more sets of such particles 
different natural frequencies, 


the phenomena of dispersion. 
made to furnish any clue to the 
cles or even to their sign. 

In the middle of the eighteenth century it had been found that air 
in the neighborhood of hot bodies is an electrical conductor. In a sys- 
tematic investigation begun in 1880 Elster and Geitel showed that at a 
red heat charged bodies lose their electrification. In 1883, in the course 
of his development of the incandescent lamp, Edison worked with some 
evacuated bulbs containing three electrodes connected to the outside. 
After a carbon filament was attached to two of these electrodes and 
brought to incandescence, Edison observed that when the third electrode 
was made positive relative to the filament, there was a current in the 
circuit connected to the third electrode. When the latter was made 
negative, no current was established. Thus Edison discovered therm- 
ionic emission several years before the existence of the electron was 


established. (Thermionic emission is discussed further in Secs. 6.8 
and 21.8.) 


, with 
it was possible to account completely for 
But the data on dispersion could not be 
magnitude of the charges on these parti- 


6.2 Discovery of Photoelectricity 


Still another suggestion that atoms might be divisible came from the 
work of Heinrich Hertz, who first produced from laboratory circuits the 
electromagnetic waves predicted by Maxwell. Hertz generated the 
waves with an oscillating circuit containing a spark gap P and detected 
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them by means of sparks at another gap S in a suitably tuned circuit. 
In 1887, to facilitate observation, he enclosed S in a black box and found 
that S must be made shorter to allow sparks to pass. Even a plate of 
glass interposed between P and S would cause this effect. Hertz con- 
cluded that ultraviolet light coming from spark P facilitated the passage 
of sparks at S when no obstacle was interposed. He found that light 
from another spark was equally effective but that in all cases the light 
must fall on the terminals themselves. This discovery of the photo- 
electric effect was accidental in the sense that it was unplanned. While 
investigating one physical phenomenon, Hertz discovered quite a dif- 
ferent one whose existence he had not even suspected. 

Hertz’s discovery at once attracted numerous investigators. Hall- 
wachs found that a freshly polished zinc plate, insulated and connected 
to an electroscope, when charged negatively and illuminated by ultra- 
violet light, would lose its charge, but there was no effect if the charge 
was posilive. He observed, by using an electrometer instead of an 
electroscope, that a neutral insulated plate, when illuminated, acquired 
a small positive potential, i.e., lost a negalive charge. Stoletow devised 
an arrangement, shown diagrammatically in Fig. 6.1, for ‘producing a 
continuous photoelectric current. P is a photoelectrically sensitive plate, 
say a polished zinc plate, connected to the negative terminal of a battery 
B of several cells. S is a wire grating or gauze connected to the positive 
terminal of the battery through a very sensitive galvanometer G. When 
ultraviolet light falls upon P, a continuous current is observed in G, 
indicating that a negative charge passes from P to S. No current exists 
if the battery is reversed. 

Elster and Geitel showed that there is a close relation between tlie 


Fig. 6.1 Stoletow’s arrange- 
ment. 
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{ 
contact-potential series of metals and the photoelectri. effect: the more 
electropositive the metal, the longer the wavelength to Which it responds 
photoelectrically. The alkali metals, sodium, potassium, and rubidium, 
were found to be sensitive even to light of the visible spectrum, That 
the charge is carried by negatively electrified particles was cleat indi- 
cated by experiments of Elster and Geitel, who showed that a transve. 
magnetic field diminishes the photoelectric current if the phenomenon 
takes place in a vacuum. It was early found that the effect persisted 
even to the highest attainable vacuum and was quite independent of 
the “degree” of the vacuum after a certain low pressure had been reached. 
This seemed to indicate that the gas molecules themselves in the region” 
between P and S could not be acting as carriers of the charge. The 
suggestion was made that, perhaps, under the influence of light, nega- 
tively charged ions of the cathode became detached and moved to the 
anode. This suggestion was rendered untenable by an experiment. by 
Lenard, in which a clean platinum wire acted as anode and a sodium 
amalgam as cathode, both being in an atmosphere of hydrogen. The 
photoelectric current was allowed to flow until about 3 X 10-° C had 
passed through the circuit. If the carriers of the charge were ions of 
sodium, each atom could hardly be expected to carry a larger charge 
than it carries in electrolysis. There should, therefore, have been 
deposited on the platinum wire at least 0.7 X 10-5 mg of sodium, a 
quantity sufficient to be detectable by the well-known flame test. On 
removing the wire from the bulb, however, no trace of sodium could be 
detected. 
If the photoelectric current is carried neither by the gas surrounding 
the cathode nor by ions from the cathode, what are the carriers? The 
answer to this question came through the convergence! of a number of 


different lines of evidence, which finally culminated in the discovery of 
the electron. 


1 Collectively these developments illustrate many characteristics of the growth 
ot modern physics: 


1. A number of seemingly unrelated lines of research frequently converge to pro- 
vide an explanation of, or a theory for, a group of phenomena not hitherto understood. 
2. The explanation or theory thus evolved is then found to bear directly on other 
branches of physics and often on other sciences. 
3. Thus, the methods of physics are both synthetic and analytic—a fortunate 
circumstance, which makes it possible for the physicist to comprehend physics 
as a whole in spite of the vast increase, particularly in recent years, of factual 
knowledge. caer 
4. ees Siecoveries: s both fact and theory, are the sine qua non of applied physics 
and of much of industry—witness, as a si i i 

kinds of photo- and AE tubes. ngle example, the wide use of the various 
5. This application of science to industr: 
research man with improved tools. Ele 
industrial purposes, are a boon to the ph 


y very frequently reacts to provide the 
ctron tubes, manufactured primarily for 
ysicist in his research laboratory. 
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6.3 The Zeeman Effect 


In 1862 Faraday, looking for a possible effect of a magnetic field upon a 
light source, placed a sodium flame between the poles of a strong electro- 
magnet and examined the D lines by a spectroscope. He was unable to 
detect any change in the appearance of the lines. 

Faraday’s failure to observe the effect that he expected was due 
to the inadequate resolving power of his apparatus. For, in 1896, 
Zeeman, repeating Faraday’s experiment with the improved techniques 
then available, discovered that spectral lines are split up into components 
when the source emitting the lines is placed in a very strong magnetic 
field; furthermore, following a suggestion by H. A. Lorentz, he found that 
these components are polarized. 

The simplest case, shown in Fig. 6.2, involves what is known as 
the normal Zeeman effect. A single line of wavelength do in the absence 
of a magnetic field is split into three components with wavelength 
Xo — AA, Ao, and Xo + Ad when viewed at right angles to the magnetic 
induction B; the lines of wavelengths ào + A) are plane-polarized with 
the electric vector perpendicular to B, while the center line hes the same 
wavelength as the original and is plane-polarized with the electric vector 
parallel to B. If one pole piece is drilled longitudinally so that one can 
view the source in a direction parallel to B, only the two components 
with wavelengths ào + Ad are seen; they are circularly polarized, as 
shown in Fig. 6.2. 

A short description will be given of the classical Zeeman-Lorentz 
theory in spite of the fact that classical theory is not applicable to such 


—> ©B out 
M= AX Xo MHA MAA AD ADF AA 
Viewed 1 to B Viewed || to B 


Fig. 6.2 The normal Zeeman effect; below are shown 
the lines and polarizations as viewed perpendicular tu 
and along B. 
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atomic phenomena. Nevertheless it gives a simple, concrete picture 
which assists the memory and yields several correct and significant 
predictions. Classically, a charge performing simple harmonic motion 
radiates light. Assume, then, that there is within an atom a particle 
of charge q and mass m bound to its equilibrium position by a force F 
proportional to its displacement; F = —br. The cartesian components 
of F are F, = —bx, F, = —by, F. = —bz. Hence in the absence of a 
magnetic field each component of the displacement varies harmonically 
with the same frequency 


1 fo 
vo = 5 E é (6.1) 


although perhaps with different amplitudes and phases. 

Now let a magnetic field be applied. The component of the motion 
of the particle parallel to the field is unaffected. This gives a line of the 
original wavelength polarized with the electric vector parallel to the 
applied B when observed at right angles to B but no line at all when 
observed along B—precisely as observed. 

In general any simple harmonic vibration can be regarded as the 
sum of two superposed circular motions executed with the same fre- 
quency but in opposite directions. Application of a magnetic field per- 
pendicular to the original vibration causes the frequencies of these two 
circular motions to be different, since the magnetic field introduces an 
additional force q(v x B) on the particle. Viewed perpendicular to B, 
radiation from the circular motion would be plane-polarized with the 
electric vector perpendicular to B. For steady circular motion 


ot oB 
manager =e (6.2) 


Putting v = 2rrv and b = Ar?mvo 
solving for v yields 


E + oa (BY 
X Sb ee ot vt + (2) 


In all cases of interest vo >> qB/4xm, so that the frequencies of the two 
circular motions are, to a sufficient approximation, 


? from Eq. (6.1) into this equation and 


PPO pe atte (6.3) 


For a positive particle the slower rotation (longer à) occurs in a 
clockwise direction as seen by an observer looking at the source through 
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the hole in the pole piece, whereas Zeeman found counterclockwise 
rotation for the line of lower frequency. Therefore the charge on the 
radiating particle is negalive. The ratio of charge to mass for the particle 
can be computed from the separation between the outer lines, which is 
- given by Av = Bq/2xm, and so 


= Z Av (6.4) 


Zeeman’s first observation indicated that q/m was of the order of 
10 C/kg. Later, working with higher resolving power, he found 
q/m = 1.6 X 10" C/kg, compared with the modern value of 1.759 X 
10"! C/kg for the electron. 

Later experiments have shown that the Zeeman effect is much 
more complicated than this picture suggests. Nevertheless the work of 
Zeeman and Lorentz provided the first evidence that spectral lines were 
radiated by negative charges, yielded a reasonably good value for e/m 
(the charge on the electron is —e), and stimulated the accumulation of 
data from which a better understanding could be achieved. Very often 
more than three components are observed, and the theory above fails 
completely to explain these more complicated patterns; indeed, it does 
not even explain why only vibrations parallel and perpendicular to B 
should be considered! (The Zeeman effect is discussed further in Chap. 
18.) 


6.4 Cathode Rays and the Electron 


Before 1897, many studies had been made of that beautiful phenomenon, 
the discharge of electricity through rarefied gases. Let the discharge 
from an induction coil or an electrostatic machine pass between the nega- 
tive terminal C (cathode) and the positive terminal A (anode) sealed 
into a glass tube (Fig. 6.3) which is being exhausted through the side 


Si S2 


Fig. 6.3 Tube for studying electrical discharge at low 
pressure. ` 
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tube T. At a very low pressure, there appears around the cathode a 
dark space, known as the Crookes dark space, which, with further decrease 
in pressure, grows longer, i.e., extends farther toward D, until finally it 
reaches the glass walls of the tube, which then have a fluorescent glow. 
If screens pierced with holes S, and Sz are introduced, the glow is confined 
to a spot on the end of the tube at D, thus indicating that some things 
called cathode rays are proceeding from the cathode and causing the glass 
to fluoresce. By suitably curving the cathode, these rays can be focused 
on a piece of platinum foil within the tube, heating it to incandescence. 
That cathode rays bear negative charge was inferred from their deflec- 
tions in electric and magnetic fields; in 1895 Perrin caught the rays in an 
insulated chamber connected to an electroscope and confirmed that 
they carry negative charge. 

In 1897 J. J. Thomson measured the ratio of charge to mass for 
cathode rays with the apparatus of Fig. 6.4. A highly evacuated tube 
contained a cathode C and an anode A, having a small slit through which 
the cathode rays passed. A second slit Az electrically connected to Ay 
helped define a sharp beam of cathode rays, which’ produced a small 


But when a 


fluorescent patch at O in the absence of fields beyond Ao 


j Fig. 6.4 (a) Thomson’s apparatus for dete 


A tmining e/m for electrons. (b) Deflec- 
tion of an electron by the electrostatic field (not to scale), © 
, 
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potential difference V was applied between plates P, and P2, the beam 
was deflected to point O’. By the use of a pair of Helmholtz coils, not 
shown, a uniform magnetic field out of the plane of the paper was pro- 
duced throughout the volume between P, and P». Two observations 
were then made. With a given electric intensity E between the plates 
the magnetic induction B was adjusted until the beam struck at the 
original position O. Then the downward force eE due to the electric 
field was balanced by the force ev x B due to the magnetic field. Hence 
eE = evB and 


v= È é (6.5) 


Thus the first observation serves to measure the velocity v of the particles; 
the crossed electric and magnetic fields serve as a velocily selector. 

The magnetic field is then removed, and the deflection OO’ caused 
by the electrostatic field alone is measured. From this the deflection 
yı of the particles as they pass over the distance l between the plates can 
be determined. This deflection results from a uniform acceleration 
a, = eE/m during a time l/v; hence, 


Ne 
n=ÆEt (;) (6.6) 


from which e/m can be computed. 

Thomson found that e/m determined in this way was independ- 
ent of the kind of gas in the tube (air, Hz, or CO2) and likewise independ- 
ent of the electrodes (Al, Fe, or Pt). His final determination was 
e/m = 1.7 X 10" C/kg, a value almost identical with the value found by 
Zeeman for the particles taking part in light emission. It is very much 
greater than e/m for hydrogen atoms in electrolysis. 


6.5 The Electronic Charge 


Shortly before Thomson’s work the study of the charges on gaseous ions 
such as are produced in a gas by x-rays had begun. To measure their 
charges, Townsend, working in Thomson’s laboratory, utilized the clouds 
that form about the ions in saturated air. By observing the rate of 
fall of the cloud and applying Stokes’s law for the free fall of spheres 
through a viscous medium, he was able to determine the size of the 
droplets; from a measurement of the total amount of water in the cloud, 
he could then calculate the number of droplets it had contained. He 
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assumed that each droplet contained just one ion; hence, having measured 
also the total charge on the cloud, he was able to calculate the charge 
on a single ion. For this charge he obtained a value of about 3 X 107" 
electrostatic unit. A repetition of the measurements with some modi- 
fications by Thomson gave the value 6.5 X 10-!° esu (2.2 X 10-9 C). 

Thomson assumed that the charge on the gaseous ions was the same 
as that on his cathode particles. It followed, then, that the cathode par- 
ticles must be previously unknown particles of extremely small mass, 
which he called corpuscles or primordial atoms. For many years English 
writers stuck to the name corpuscle for these particles, using the word 
electron in Stoney’s original sense to denote the amount of charge 
carried by a corpuscle or a univalent ion; but others, including Lorentz, 
called the corpuscles themselves electrons, and this usage ultimately 
became well established. It was generally assumed that electrons are 
a constituent part of all atoms and are responsible for the emission of 
light by them, thus accounting for the fact that the ratio e/m had been 
found to be the same for the vibrating particles causing the Zeeman 
effect as for the cathode rays. Thus was made possible the explanation 
of a number of more or less diverse phenomena on the basis of a single 
concept. 

A more reliable method of measuring ionic charges was developed 
by Millikan in 1909. He found that tiny droplets of oil in ionized air, 
viewed under a microscope, would frequently pick up charges and could 
then be held suspended, or accelerated upward or downward, by applying 
a suitable electric field. When uncharged, the droplets fall at a slow, 
uniform rate, their weight being balanced by the drag due to the viscosity 
of the surrounding air. By observing their rate of fall, Millikan was 
able to determine the size and weight of the droplets. According to 
Stokes’s law, a sphere of radius a moving at a steady slow speed vo through 
a fluid whose coefficient of viscosity is 7 experiences a resisting force 
F = 6xnav. If we equate this force to the weight of a droplet, which is 
$ra’pg in terms of the density p of the oil, we have 


$7a%pg = brnavo (6.7) 


Now suppose the drop picks up a charge q and that a vertical electric 
field E is present. Then a force gE is added to the weight, and we have 


$ra%py -I gE = 6rnan (6.8) 
tı being the new velocity of steady fall. From these equations we find 


gE = 6rna(vı — vo) 
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and inserting in this equation the value of a given by (6.7) and solving 
for q, we find 


"o \t 
q = rni (vı — vo) 2) E (6.9) 


All quantities in this last expression being known, q can be calculated. 

Millikan found that the charges so calculated from his observa- 
tions were all multiples of a smallest charge, assumed to be the charge 
on the electron. The value of this smallest charge he reported to be 
—e, where e = 4.774 X 107" esu = 1.591 X 107" C. 

This value of e was accepted for the next 20 years. In 1928, how- 
ever, Bäcklin pointed out that x-ray wavelengths as measured by means 
of a ruled grating could be made to agree with the same wavelengths 
measured by means of a calcite crystal provided Millikan’s value of e 
was assumed to be 0.4 percent too small. The discrepancy was later 
traced to Millikan’s having used too low a value of the viscosity of air, 
which appears as 7 in Eq. (6.9). Improved measurements gave a value 
of 7 nearly 0,5 percent greater than Millikan’s, resulting in the value 
1.602 X 10-1* C for e as calculated from the oil-drop observations. 

It is now recognized, however, that what is obtained directly from 
the x-ray measurements is actually a value of N4, the number of atoms 
in 1 kg atom; e is then calculated as 10°F/ Na, where F is the value of the 
faraday (Sec. 6.1). The accuracy of the value of e obtained in this 
manner is greater than that of the oil-drop observations. Millikan’s 
work nevertheless remains of fundamental importance. He demon- 
strated that electric charges in ionized gases occur in multiples of a 
single fundamental unit, and the close agreement of his value of e with 
that required by the x-ray data constitutes an important confirmation 


of the principles underlying the calculation of e from the x-ray data 
themselves. 


6.6 Photoelectrons 


The discovery of the electron suggested the hypothesis that the photo- 
electric effect is due to the liberation of electrons. This hypothesis was 
confirmed in 1900 by Lenard. By measuring the deflection of the 
“photoelectric rays” in a known magnetic field, he found a value of e/m 
in qualitative agreement with Thomson’s value for electrons. Lenard’s 
method of determining e/m for photoelectrons involves basic principles 
which, with ever increasing refinement, have been widely applied in 
charged-particle physics. His apparatus is shown diagrammatically in 
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Fig. 6.5a. A glass tube, evacuated through the side tube T, contained 
an aluminum cathode C, which could be illuminated by ultraviolet light 
from a spark S, the light passing through the quartz plate Q. A sereen 
A, with a small hole at its center and connected to earth, served as anode. 
P, and P; were small metal electrodes connected to electrometers. When 
C was illuminated and charged to a negative potential of several volts, 
photoelectrons were liberated and accelerated toward the anode A. A 
few clectrons passed through the hole in A and proceeded thereafter 
at uniform velocity to the electrode P,, their reception there being 
indicated by electrometer 1. But if, by means of a pair of Helmholtz 
coils (represented by the dotted circle), a magnetic field directed toward 
the reader was produced in the region between A and P,, the electrons 


were deflected upward in a circular path and, with a sufficient field 
strength, struck the electrode P3. 


To To 


electrometer 2 


715 —20 
Vo 


Potential of cathode, volts 
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Fig. 6.5 (a) Lenard’s apparatus for determinin, 
Variation of photoelectric current with cathode penne fo photoelectrons: 2), 
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Lenard first investigated the relation between the current reaching 
the anode and the potential V applied to C. There was no photo- 
electric current when V was several volts positive. But, when V was 
dropped to about 2 V positive, a small current was observed. This 
indicated that the photoelectrons were not simply freed from the cathode 
but that some were ejecled with sufficient velocity lo enable them to overcome 
the relarding polential of 2 V. The current increased when V was reduced 
to zero and as V was made negative but attained a saturation value after 
V had reached some 15 or 20 V negative. These data are shown diagram- 
matically in Fig. 6.56. Vo is the positive potential that was required 
to prevent the escape of electrons. 

The determination of e/m was made essentially as follows. Let a 
negative potential V, large compared with Vo, be applied to the cathode 
C. The photoelectron, on reaching the anode, will have a kinetic energy 
given approximately by 


eV = m? 


where v is its velocity on reaching A (Fig. 6.5a). Assuming that after 
leaving A the electron moves in a uniform magnetic field, the circular 
path it follows is determined by the equation 
mv? 
B = — 
ev R 
where B is the magnetic field intensity just necessary to cause the elec- 
tron to reach P», and R is the radius of the corresponding circular path, 
determined from the geometry of the apparatus. From the above two 
equations we have 


= 2V (6.10) 


6.7 What Is the Photoelectric Mechanism? 


Efforts to imagine a mechanism for photoelectric emission on the basis 
of the classical electromagnetic theory of light encountered great diffi- 
culties. It was early assumed that metals contain a number of free 
electrons in the spaces between the atoms and that electric currents are 
due to a drift of these free electrons caused by the electric field. Electro- 
magnetic waves entering a metal would agitate the free electrons, and 
it could readily be imagined that occasionally an electron near the 
surface of the metal would be given enough energy to eject it entirely. 
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If photoele trons originated in this way, however, it would be 
expected that they would emerge with smaller kinetic energies when the 
incident light is faint than when it is strong, whereas the experimental 
fact is that their kinetic energies are independent of the intensity of the 
light. On the other hand, their energies do depend on the frequency of 
the light. A 

The latter two experimental facts taken together led at one time 
to the proposal of an alternative hypothesis, viz., that the photoelectrons 
come from inside the atoms, and that their release is of the nature of a 
resonance phenomenon. The light might function as a trigger releasing 
the electron from inside an atom in which it already possessed energy 
of considerable magnitude. Light of a given frequency would then 
release only those electrons which were tuned to that frequency, and the 
atomic mechanism might conceivably be such that the energy of the 
expelled electron would be proportional to the frequency. It was hard 
to believe, however, -that the atoms contained resonating systems of all 
frequencies for which photoelectric emission was observed. Further- 

more, the fact remained to be explained that the number of the photo- 
electrons is proportional to the intensity of the light. 

The experimental facts thus seem to demand the conclusion that the 
photoelectric energy comes directly from the energy of the incident light. 
Another difficulty is then encountered, however. It is hard to under- 
stand how so much energy can be absorbed by a single electron. Accord- 
ing to the electromagnetic theory of light, the energy of light waves is 
distributed equally over the wavefront. Let it be assumed, for purposes 
of calculation, that the electron can absorb as much of this energy as 
falls on the area occupied by one atom lying on the surface of the metal. 
Then it can be calculated that the time required for a photoelectron to 
absorb the maximum energy of emission, }mv,2, from faint light of 
sufficient intensity to cause an easily measurable photoelectric emission 
from sodium would be more than 100 days. The situation is improved 
if the electron is assumed to vibrate inside the atom in exact resonance 
with monochromatic light, since it can be shown that the electron can 
then manage to absorb as much of the incident energy as falls upon a 
considerable fraction of a square wavelength. Even so, however, the 
calculated time exceeds 1 min. Thus, if the electron obtains its energy 
by an ordinary process of absorption, there should be an appreciable lag 
between the beginning of illumination and the start of the photoelectric 
current. Precise measurements showed, however, that, if such a lag 
exists, it is less than 3 X 10-9 s. 

A satisfactory explanation of the photoelectric mechanism was first 

proposed by Einstein in 1905 by his extension to the photoelectric process 
of Planck’s concept that interchanges of energy between radiation and 
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matter take place in energy quanta of magnitude hy (Sec. 5.7). Einstein 
assumed that a whole quantum hv of radiant energy was absorbed by a 
single electron. This implies that not only is radiant energy emitted 
in quanta of energy hv but also that this radiant energy is transmitted 
as a sufficiently localized energy packet to be absorbed asa unit. Sucha 
quantum of electromagnetic radiation is called a photon. Einstein-pro- 
posed further that if a minimum energy ¢o is required to liberate an 
electron from the metal, the maximum kinetic energy Kmax of photo- 
electrons freed by photons of energy hy should be given by 


Ki hi — oo (6.11) 


This is the Einstein pholoelectric equation. Only qualitative data were 
available at that time showing that his equation gave results of the right 
order of magnitude, but subsequently the equation received experimental 
verification (Fig. 6.6). Its validity has been extended to the x-ray region, 
where the frequencies are several thousand times the frequencies of 
visible light. Einstein’s photoelectric equation played an enormous part 
in the development of the modern quantum theory. 

By regarding light as a rain of corpuscles or photons of energy hy 
and assuming that a whole photon can be absorbed by an electron, 
Einstein at once had an explanation of the principal properties of the 
photoelectric effect in metals. There could be no observable time lag 
in the photoelectric process; the current would be exactly proportional 
to the light intensity, i.e., to the number of photons striking per second; 


Kinetic energy, Km 


i 7 i Frequency, v 


A Po 


Fig. 6.6 Variation of maximum energy 
of photoelectrons with frequency of the 
exciting radiation. 
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and the velocities of the photoelectrons would be entirely independent 
of the light intensity, since the absorption of each photon would be a 
process quite independent of the absorption of other photons. The 
Einstein equation would follow naturally. 

The difficulties with such a radical theory of light, however, are 
many. For one thing, if we regard light as a “shower” of photons, what 
can possibly be the meaning ‘of frequency? There is nothing periodic 
about a falling raindrop, for example. As a matter of fact, in order to 
find the frequency of a beam of light, we measure (1) the velocity c of 
the light and (2) its wavelength on the assumption that light consists 
of waves, and then we compule the frequency as » = ¢/), Thus we rely 
on the wave theory of light lo give us the energy value hv of a pholon! And 
there still remains the phenomenon of interference, which, since its dis- 
covery by Young in 1802, has defied explanation on any other basis than 
by assuming light to have wave properties. However, the experimental 
facts of photoelectricity are just as cogent as the phenomena of interfer- 
ence, and these cannol be explained on the basis of the classical wave theory 
of light. 

Thus in 1920 the physicist could sum up the situation about like 
this: on one side of a seemingly impenetrable barrier is to be found a 
group of phenomena—such as interference, polarization, indeed, the 
whole electromagnetic theory—according to which he should say that 
light must consist of waves. On the other side of the fence is to be found 
another group—the photoelectric effect, and other phenomena which we 
shall consider in subsequent chapters—according to which he should say 
that light must be corpuscular. The situation thus created was perhaps 
the most puzzling one that has ever arisen in the whole history of 
The dilemma arose from the belief of classical physicists that certain 
observables were either waves or Particles. The abstract idealization of 
waves, supported by a well-developed mathematical theory, had been 


» and in Chap. 11 we present 


A modern point of view 
might be expressed thus: there is in nature nothing which in every aspect 


can be regarded as either an ideal wave or as an ideal particle; rather, 
all the quantities of physics which were classically divided into wave or 
particle classes have both wave and particle properties, i 

The particle and wave aspects of electromagnetic radiation are not 
shown simultaneously, nor do they ordinarily appear in the same experi- 


EE —————— 
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ments, although there are experimental results which can be explained 
either in terms of a wave description or of a particle description. In 
1928 Bohr suggested that the wave and particle properties of radiation 
are complementary and to explain the results of an experiment we must 
choose one interpretation or the other; it is not possible to apply both 
descriptions at the same time. This idea is known as the Bohr comple- 
mentarily principle. It is applicable to electrons and other classical 
particles as well. k 


6.8 Properties of Photoelectric Emission 


From Einstein’s theory there arise several conclusions which have been 
verified by subsequent experiments. 

1. The photoelectric current is directly proportional to the intensity 
of illumination on the emitting surface so long as there is no change in 
the spectral quality of the incident radiation. Experiments of Elster 
and Geitel in 1916 showed that this proportionality holds rigorously 
over an intensity range of 5 X 107 to 1. For a single frequency of inci- 
dent light the number of photoelectrons emitted is proportional to the 
number of photons reaching the surface. 

2. There is essentially no time lag between the arrival of the first 
photons and the emission of the photoelectrons. Measurements of 
Lawrence and Beams in 1927 showed that this lag is less than 3 X 107° s. 

3. The maximum kinetic energy Kimax of the photoelectrons ejected 
from a surface by a monochromatic beam increases with the frequency 
of the incident radiation as predicted by Eq. (6.11). A plot of Kinax as 
a function of » is a straight line with slope h (Fig. 6.6) and an intercept 
on the frequency axis of vo = ¢0/h, as was shown by Millikan in 1916. 
Here vo is the photoelectric threshhold frequency below which no photo- 
electrons are emitted. The critical frequency vo has been found to vary 
considerably with the state of the surface, as does the photoelectric 
current in general. Usually vo lies in the ultraviolet, but for the alkali 
metals and for barium and strontium it lies’ in the visible region; for 
potassium vo lies in the red, and for cesium even in the infrared. 

4, Most of the photoelectrons have kinetic energies less than Kinax. 
Figure 6.7 shows data on the energy distribution of photoelectrons from 
aluminum published by Richardson and Compton in 1912. They used 
a small photoemitter located at the center of a spherical conductor, to 
which they applied the retarding potential V and measured the photo- 
current as a function of retarding potential for several frequencies in 
the ultraviolet. For à = 2 X 10-7 m (v = 1.5 X 10% s~) no photo- 
electrons reach the spherical conductor when the retarding potential 
exceeds 2.3 V, and so Kmax for this frequency is 2.3 eV. 
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Fig. 6.7 (a) Variation of photoelectric current 
with retarding potential. (b) Energy distribution 
of photoelectrons. 


5. Varialion of temperature usually has little or no effect upon the 
photoelectric current, so long as the temperature does not exceed several 
hundred degrees Celsius and so long as no change occurs in the crystal 
structure or in the physical state of the metal. The alkalis, however, 
form an exception to this rule. 

6. If polarized light is used, differences are commonly found as the 
plane of polarization is rotated, except, of course, at normal incidence. 
The effect is complicated, and its cause is in doubt. Especially interest- 
ing is the selective effecl in the alkali metals. Over certain ranges of 
wavelength, the photoelectric current from these metals is much greater 
when the electric vector in the light has a component perpendicular to 
the surface than when it is parallel to the surface; in the case of certain 
sodium-potassium alloys, the ratio of the currents in the two cases may 
be 10:1 or 20:1 or even more. 


6.9 Thermionic Emission 


urrent from an emitter increases very rapidly with 


The thermionic € i 
it depends also upon geometrical factors, which 


jncreasing temperature. 
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determine space-viarge effects, aua upon the potential of the emitter 
relative to its surroundings. From a given emitter operating at a given 
temperature, however, the thermionic current cannot be made to exceed 
a certain limiting value I, given by (Sec. 21.8) 


I = ATT (6.12) 


in which ¢ and A are constants depending on the emitting substance 
and on the state of its surface. 

The quantity $ represents an energy; in the theoretical derivation 
it represents the heat of emission of the electrons (or thermions) at the 
absolute zero of temperature, i.e., the amount of energy required to 
extract an electron at 0°K. As calculated from thermionic data,. ¢ 
ranges from 2 to 6 eV; ¢ is called the work funclion. 

It was early assumed that thermions and photoelectrons come from 
the same source inside the metal, the two types of emission differing 
chiefly in the mechanism by which an electron acquires sufficient energy 
to enable it to escape from the emitter. On this view, the quantity ¢ 
in the thermionic equation ought to be at least approximately the same 
as ¢ = hyo in the Einstein photoelectric equation. 

It has not been easy to check this equation by experiment, owing to 
the many spurious disturbances inherent in both photoelectric and 
thermionic research. Reasonably comparable measurements have, how- 
ever, been accomplished on a number of the more stable metals; a few 
values are as follows: 


Metal 


Platinum 6.27 
Tungsten 4.52 
Silver 


Gold 


The agreement between the values of the work function determined 
photoelectrically and those determined thermionically warrants the 
hypothesis that the photoelectrons and the thermionic electrons have a 
common origin. 


Problems 


1. An electron is accelerated through a potential difference V. It then 
passes hetween two long parallel plates with electric intensity E between 
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them. Perpendicular to E is a magnetic induction B of strength such 
that the electron is not deflected as it passes between the plates. Show 
that e/m = E?/2VB?. 
2. Electrons with a maximum kinetic energy of 2.3 eV are ejected from 
a surface by ultraviolet photons with à = 200 mu. Calculate the energy 
of the incident photons in joules and electron volts. What is the smallest 
energy which can free an electron from the surface? What is the wave- 
length of a photon with this energy? 
Ans: 9.9 X 10718 J; 6.2 eV; 3.9 eV; 320 mu 
3. Find the maximum kinetic energy of photoelectrons ejected from a 
potassium surface, for which ġo = 2.1 eV, by photons of wavelengths 
2000 and 5000 A. What is the threshold frequency vo and the corre- 
sponding wavelength? $ 
Ans: 4.1 eV, 0.4 eV; 5.08 X 101 Hz, 5900 A 
4. The photocurrent of a cell can be cut to zero by a minimum retard- 
ing potential of 2 V when monochromatic radiation of à = 250 muy is 
incident. Find the work function, the stopping potential for à = 200 mu, 
and the threshold wavelength. 
5. An experiment on the photoelectric effect of cesium gives the results 
that the stopping potentials for à = 4358 and 5461 A are respectively 
0.95 and 0.38 eV. From these data, find h, the threshold frequency, 
and the work function of cesium. 


Ans: 4.6 X 10" Hz; 1.9 eV 
6. An experiment on the threshold wavelength for photoelectric emis- 
sion from tungsten yields the value 270 mz. Find the photoelectric 
work function in joules and in electron volts. Compute the maximum 
energy of photoelectrons ejected by photons of \ = 200 mz. 

Ans: 7.34 X 10-19 J; 4.59 eV; 1.61 eV 
7. An electron with kinetic energy K enters a uniform magnetic field 
of induction B which is perpendicular to the velocity of the electron. If 
B is 0.005 Wb/m?, find the radius of the electron’s path when K is 
(a) 90 eV, (b) 200 keV, (c) 200 MeV. 

Aas: (a) 9.54 mm; (b) 33 cm; (c) 134 m 
8. Compute the separation in angstrom units of the outer two lines 
of a normal Zeeman pattern for spectral lines of 3000 and 6000 A wave- 
length in a magnetic induction of 1.8 Wb/m?. 
9. A cathode-ray tube deflects 1500-eV electrons by use of a magnetic 
field which may be taken as uniform over a distance of 2 cm and zero 
elsewhere. If the fluorescent screen is 25 cm beyond the deflecting coils, 
find the beam deflection on the screen for B = 0.002 Wb/m?. 


Ans: 8 cm! 
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10. An electron is projected along the axis of a cathode-ray tube 
midway between two parallel plates with a speed of 1.5 X 107 m/s. 
The plates are 1.2 cm apart, 3 cm long, and have a potential difference 
of 120 V between them. Find (a) the angle with the axis at which the 
electron leaves the plates, (b) the distance of the electron from the axis 
as it leaves the plate, and (c) the distance from the axis at which it 
strikes a fluorescent screen 20 cm beyond the plates. 


Ans: (a) 0.23 rad; (b) 0.35 em; (c) 5.0 cm 
11. A cathode-ray tube has electrostatic deflection plates 2 cm square, 
separated by 0.5 cm and 25 cm from the fluorescent screen. If the 
electrons have 1500 eV of kinetic energy when they pass between the 
plates, calculate the approximate beam deflection on the screen if 20 V 
is applied between deflector plates. 


Ans: 7 cm 
12. Show that the path of an electron after it has passed between the 
plates of Vig. 6.4 is a straight line which, if extended hack, originates at 
the midpoint of the volume between the plates as though the incident 
electron went undeflected to that point and then changed direction 
discontinuously. 
13. A photon with 13,600 eV of energy interacts with a hydrogen atom 
at rest and ejects the electron photoelectrically in the direction in which 
the photon was traveling. If 13.6 eV is required to eject the electron, 
find the speed of the photoelectron and the momentum and energy of 
the recoiling proton. 

Ans: 6.7 X 107 m/s; 6.3 X 10- kg-m/s; 7.4 eV 
14. In an apparatus designed by Busch in 1922 for measuring e/m, 
electrons fall through a potential difference V and pass through a conical 
aperture of half-angle ¢. They enter a uniform axial magnetic ficld B 
supplicd by a solenoid with their velocity vectors making an angle ¢ 
with the axis and spiral down the tube. If B is increased from zero, 


Fig..6.8 Busch’s arrangement for measuring ¢/m. 
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there come values of B at which the electrons refocus at the axis of the 
screen S. Show that the electrons are first focussed on S when 
e/m = (8x?V cos? $)/B?L*. (See Fig. 6.8.) 

15. Prove that a free electron cannot absorb a photon photoelectrically 
by showing that the conservation of energy and the conservation of 
momentum cannot be satisfied simultaneously in such a process. 


16. (a) In Thomson’s experimental arrangement for finding e/m a 
narrow beam of electrons of velocity v passes between parallel plates of 
length l and separation d. If a potential difference V is applied to the 
plates, show that the impact point of the beam on a fluorescent screen 
a distance D beyond the plates moves through a distance 


V (D + 1/2) 


e 
md v? 


(b) Prove that the deflection of the 
the electrons suffered a sin 


6 = s/(D + l/?) a distance D 


beam is the same as though 
gle abrupt deflection through an angle 
+ 1/2 from the screen (Fig. 6.4b). 


~ Ja 


chapter seven 


X-rays 


Probably no subject in all science illustrates better than x-rays the 
importance to the entire world of research in pure science. Within 3 months 
after Rénigen’s fortuitous discovery, x-rays were being put to practical use 
in a hospital in Vienna in connection with surgical operations. The use 
of this new aid to surgery soon spread rapidly. Since Rénigen’s time, 
x-rays have completely revolutionized certain phases of medical practice. 
However, had Rénigen deliberately set about to discover some means of 
assisling surgeons in reducing fractures, it is almost certain that he would 
never have been working with the evacuated tubes, induction coils, and the 
like, which led to his famous discovery. 

In many other fields of applied science, both biological and physical, 
important uses have been found for x-rays. Transcending these uses in 
applied science are the applications of x-rays to such problems as the alomic 
and the molecular structure of matter and the mechanism of the interaction of 
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radialion with matter. X-rays provide us with a kind of supermicroscope, 
by means of which we can “see” not only aloms and their arrangement in 
crystals but also even the interior of the atom itself. Réntgen’s discovery 
must be ranked among the most important. 


7.1 The Discovery of X-rays 


In the autumn of 1895, Wilhelm Konrad Réntgen, professor of physics 
at Wiirzburg, was studying the discharge of electricity through rarefied 
gases. A large induction coil was connected to a rather highly evacuated 
tube (Fig. 7.1), the cathode C being at one end and the anode A at the 
side. The tube was covered “with a somewhat closely fitting mantle of 
thin black cardboard.” With the apparatus in a completely darkened 
room, he made the accidental observation that “a paper screen washed 
with barium-platino-cyanide lights up brilliantly and fluoresces equally 
well whether the treated side or the other be turned toward the discharge 
tube.” The fluorescence was observable 2 m away from the apparatus. 
Röntgen soon convinced himself that the agency which caused the 
fluorescence originated at that point in the discharge tube where the 
glass walls were struck by the cathode rays. 

Realizing the importance of his discovery, Réntgen at once pro- 
ceeded to study the properties of these new rays—the unknown nature 
of which he indicated by calling them x-rays. In his first communica- 
tions he recorded, among others, the following observations: 


1. All substances are more or less transparent to x-rays. For example, wood 
2 to 3 cm thick is very transparent. Aluminum 15 mm thick “weakens the 
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\ Fig. 7.1 Type of tube with which Röntgen discovered 
xkays. 
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effect considerably, though it does not entirely destroy the fluorescence.” 
Lead glass is quite opaque, but other glass of the same thickness is much 
more transparent. “If the hand is held between the discharge tube and the 
screen the dark shadow of the bones is visible within the slightly dark shadow 
of the hand.” 

2. Many other substances besides barium-platino-cyanide fluoresce—calcium 
compounds, uranium glass, rock salt, etc. 


3. Photographic plates and films “show themselves susceptible to. x-rays.” 
Hence, photography provides a valuable method of studying the effects of 
x-rays. 

4. X-rays are neither reflected nor refracted (so far as Röntgen could discover). 
Hence, “x-rays cannot be concentrated by lenses.” 


5. Unlike cathode rays, x-rays are not deflected by a magnetic field. They 
travel in straight lines, as Réntgen showed by means of pinhole photographs. 

6. X-rays .discharge electrified bodies, whether the electrification is positive 
or negative. 

7. X-rays are generated when the cathode rays of the discharge tube strike 
any solid body. A heavier element, such as platinum, however, is much 
more efficient as a generator of x-rays than a lighter element, such as aluminum. 


It is a tribute to Réntgen’s masterly thoroughness that most of the 
basic properties of x-rays were described in the paper first announcing 
the discovery. Intense interest was aroused, and work on x-rays began 
at once in many laboratories both in America and inEurope. This 
early work is beautifully illustrative of the qualitative phase of develop- 
ment of a typical field of physics. 


7.2 Production and Detection of X-rays 


Until 1913, tubes for the production of x-rays were similar to a form 
suggested by Röntgen. A residual gas pressure of the order of 10-? mm 
Hg provides, when voltage is applied, a few electrons and positive ions. 
These positive ions, bombarding the cathode, release electrons, which, 
hurled against the anode, give rise tox-rays. A curved cathode converges 
the electrons into a focal spot of desired shape and size. In this type 
of tube, known as the gas tube, the anode current, applied voltage, and 
gas pressure are more or less interdependent, and it is essential that the 
gas pressure be maintained at the desired value. Various ingenious 
devices were introduced for accomplishing this. In 1913, however, an 
important improvement was introduced by Coolidge. He evacuated the 
tube to the highest attainable vacuum and incorporated a hot spiral 
filament of tungsten to serve as a source of electrons. The filament was 
heated by an adjustable current from a battery. Thus the electron 
current could be controlled independently of the applied voltage. 
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x-rays 


Fig. 7.2 An ionization chamber for measuring the intensity of a beam of 
x-rays or other ionizing radiation. 


For quantitative measurements, the ionization method was early 
adopted. The discharging effect of x-rays upon charged bodies was 
traced to ionization of the molecules of the surrounding gas. The effect 
was found to increase rapidly with density and also to depend on the 
nature of the gas, the following being increasingly active in the order: 
H», CO, air, CO., ether vapor, CS». At first the rate of discharge of an 
electroscope was used in measuring the intensity of an x-ray beam, 
but later an ionization chamber was introduced. This is shown. sche- 
matically in Fig. 7.2. C is a metal tube several centimeters in diameter, 
from about 20 to 100 cm long, and closed at both ends except for an 
opening or window W, over which may be placed a thin sheet of cello- 
phane or aluminum for admitting the x-rays. A rod rr suitably supported 
by good insulating material, such as amber or quartz, is connected to an 
electrometer or a vacuum-tube amplifier. An electric field is maintained 
between the rod rr and the cylinder C by a battery B. The chamber 
may be filled with a heavy gas to make the arrangement more sensitive; 
argon or methyl bromide is often used. When x-rays enter the window 
W, the gas within the cylinder is made conducting, and because of the 
electric field between the cylinder and the rod, the latter acquires a 
charge at a rate which is a measure of the intensity of the x-ray beam. 
Nowadays a Geiger counter or a scintillation detector with appropriate 
electronic circuits may replace the ionization chamber. 


7.3 Wavelengths of X-rays 


Although the hypothesis that x-rays are a form of electromagnetic radia- 
tion was early suggested, it proved to be difficult to establish. Evidence 
for the diffraction of x-rays was reported in 1899 by Haga and Wind, 
who concluded that the wavelengths of x-rays must be of the order of 
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10—° m, but their experiments were regarded by many as inconclusive. 
They used wedge-shaped slits only a few microns wide and observed a 
slight broadening of the image on a photographic plate. The first ` 
practical method for resolving x-ray beams according to wavelength 
developed out of a brilliant suggestion by Laue. The order of magnitude 
of x-ray wavelengths, as revealed by the diffraction experiment described 
above, is the same as the order of magnitude of the spacing of the atoms 
in crystals. Laue suggested, therefore, that a crystal, with its regular 
array of atoms, might behave toward a beam of x-rays in somewhat the 
same way as a ruled diffraction grating behaves toward a beam of 
ordinary light. Assume that plane electromagnetic waves traveling in 
a given direction fall upon a crystal. Each atom scatters some of the 
incident radiation. The wavelets scattered by different atoms combine, 
in general, in all sorts of phases and so destroy each other by inter- 
ference. Laue argued, however, that for certain wavelengths and in 
certain directions the wavelets should combine in phase and so produce 
a strong diffracted beam. It would be expected, therefore, that such 
diffracted beams might be observed upon passing a heterogeneous x-ray 
beam through a crystal. 

Such an experiment was performed by Friedrich and Knipping in 
1912. A narrow pencil of x-rays was allowed to pass through a crystal 
beyond which was a photographic plate. After an exposure of many 
hours, it was found on developing the plate that, in addition to the 
interior central image where the direct beam struck ‘the plate, there were 


Fig. 7.3 Laue photograph of a tungsten 
crystal. 
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present on the plate many fainter but regularly arranged spots, indicating 
that the incident x-ray beam had been diffracted by the crystal in certain 
special directions, just as Laue had predicted (Fig. 7.3). In their 
original paper, Friedrich, Knipping, and Laue, from an analysis of a 
series of photographs of a crystal of zinc blende oriented at various angles 
with respect to the incident pencil, concluded that there were present 
in the x-ray beam wavelengths varying between 1.27 and 4.83 X 10-7" m. 
This result supported the lwo postulates underlying the experiment: 
(1) that x-rays are electromagnetic waves of definite wavelengths, and 
(2) that the atoms of a crystal are arranged in regular three-dimensional 
order, as suggested by the external symmetry of crystals. ‘ 

This experiment marked the beginning of a new era in x-rays. 
Two new fields of investigation were at once opened up: (1) in x-rays, 
the study of spectra and the use of homogencous beams in experiments on 
scattering, absorption, etc.; (2) the study of the arrangements of atoms 
or molecules in crystals. In the following sections, we shall confine our 
discussion to some of the more important aspects of the former field. 


7.4 Bragg’s Law 


Immediately following the announcement by Friedrich, Knipping, and 
Laue of their successful experiment many investigators took up the study 
of the new phenomenon. A simple way of looking at the process of 
diffraction by a crystal grating was proposed by W. L. Bragg. He 
pointed out that through any crystal a sct of equidistant parallel planes 
can be drawn which, among them, pass through all the atoms of the 
crystal. Indeed, a great many such families of planes can be drawn, 
the planes of each family being separated from cach other by a char- 
acteristic distance. Such planes are called Bragg planes, and their 
separations, Bragg spacings. Traces of five familics of Bragg planes 
are shown in Fig. 7.4. 

If plane monochromatic waves fall upon the atoms in a Bragg plane, 
a wavelet of scattered radiation spreads out from each atom in all direc- 
tions. There is just one direction in which, irrespective of the atomic 
distribution in the plane, the scattered wavelets mect in the same phase 
and constructively interfere with each other, viz., the direction of 
specular refleclion' from the plane for which therc is no path difference for 
radiation scattered by atomsin the plane. This follows from the ordinary 
Huygens construction as used for the reflection from a mirror. The 


1 For a regular array of atoms in one plane there are also other directions for 
which constructive interference could occur, but for these directions there is destruc- 
tive interference of the wavelets from the set of parallel planes. 
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Fig. 7.4 Reflection of monochromatic beams of x-rays by a NaCl crystal when 
heterochromatic radiation is incident. 


beam scattered in this direction may be thought of as “reflected” from 
the Bragg plane. But each Bragg plane is one of many regularly spaced, 
parallel planes; the beams reflected from these various parallel planes 
combine, in general, in different phases and so destroy each other by 
interference. Only if certain conditions of wavelength and angle of inci- 
dence of the beam on the planes are satisfied do the wayes from different 
. planes combine in the same phase and reinforce each other. ‘In Fig. 7.5 
the horizontal lines represent the traces of two successive Bragg planes 
spaced d apart. Consider a plane wave incident on these planes with 0 
the angle between the propagation direction of the incident beam and 
the planes; 0 is known as the glancing angle, and the reflection conditio 

requires that 9 = 6’. For radiation scattered by the second plane ‘to! 
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Incident wavefront 


Oo B 


Fig. 7.5 Constructive interference of the waves scattered from atoms of a 
crystal occurs when nd = 2d sin 0. 


interfere constructively with that scattered by the first, the path dif- 
ference AOB must be an integer number of wavelengths, whence 


nà = 2d sin 6 (7.1) 


where n is an integer called the order of the reflection. Equation (7.1) 
together with the requirement that 0 = 6’ constitutes Bragg’s law for 
x-ray “reflection.” 

Suppose, now, that a parallel wave train containing a continuous 
spectrum of wavelengths is incident upon a crystal, as represented by the 
parallel arrows a, b, c, d in Fig. 7.4. In the figure, traces of five families 
of Bragg planes are shown, numbered 1, 2, 3, 4, 5, with their characteristic 
spacings dı, dz», . . . . Many other families of planes might be imagined, 
some perpendicular and some not perpendicular to the plane of the paper. 
Suppose that in the incident beam there is a wavelength àz such that 
nà: = 2d; sin 62, where n is an integer, d; is the distance between the set 
of planes numbered 2, and 0; is the glancing angle between the direction 
of the incident radiation and these planes. Then there will be reflected 
from this group of planes a beam A, of wavelength A» which will proceed 
in the direction of the arrow A. Similarly, we may have reflected beams 
B, C, D, . . . in different directions in the plane of the paper, and also 
many other beams reflected from other families of planes in directions 
not in the plane of the paper.. Each Laue spot in the experiment of 
Friedrich and Knipping may be interpreted as produced by such a 
reflected beam. In general, the most intense spots correspond to reflec- 
tions from Bragg planes containing the greatest number of atoms on 
each plane. i 


7.5 The X-ray Spectrometer 


W. H. Bragg and his son W. L. Bragg were responsible for the early 
development of the x-ray spectrometer, shown in Fig. 7.6a. X-rays 
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from target T pass through two narrow slits S, and S; and fall at glancing 
angle 0 on the cleavage face of a crystal K—rock salt, calcite, mica, 
gypsum, quartz, etc.—mounted on a table D, the angular position of 
which can be read. The reflected beam, which makes an angle 20 with 
the incident beam, enters an ionization chamber C by means of which 
the intensity can be measured. 

For photographic registration, the ionization chamber can be 
replaced by a photographic plate PP (Fig. 7.6b). With the crystal set 
at a glancing angle 6, the reflected beam will strike the plate at L (or 
at L’, if the crystal is reversed). From the position O at which the direct 
beam strikes the plate, the distances OL and OA, and hence the angle 
26, can be determined. 

The distance d between the reflecting planes of a crystal such as 
NaCl is determined as follows. In the rock-salt crystal, Na and Cl ions 
occupy alternate positions in a cubic lattice similar to that in Fig. 7.4, 
which represents one plane of atoms. Taking the atomic weight of Cl 
as 35.45 and of Na as 22.99, we find the molecular weight of NaCl to 
be 58.44. Therefore, 58.4 kg of the NaCl contains 2N, atoms, where N4 
is Avogadro’s number. Thus we find for the number of atoms n in 


(b) 


Fig. 7.6 An x-ray spectrometer using (a) ionization detec- 


| tion and (b) photographic detection. 
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L m? of rock salt 2N4p/58.44, where p = 2164 kg/m’, the density of 
crystalline NaCl. If dis the distance between the center of one atom and 
the next along the edge of the cube, l/d is the number of atoms 
in a row of atoms 1 m long, and n = 1/d*. From these two equa- 
tions d = 2.82 X 10-1 m = 2.82 A. 
By using crystal gratings, x-ray wavelengths can be compared with 
a precision of a few parts in 105, which is more precise than Avogadro's 
number and the density and molecular weight of a typical crystal are 
known. Therefore, x-ray wavelengths have often been reported in terms 
of X units, defined by taking the grating space of calcite at 18°C to be 
3029.45 KU. It is now known that the X unit is 0.0202 percent less 
than 10? m. Table 7.1 lists some crystals commonly used in x-ray 


spectroscopy together with their grating spaces and the linear expansion 
per degree Celsius. 


Table 7-1 Grating spaces of some crystals used in 
x-ray spectroscopy 


Grating Space d at 18°C Change in d 
SS per Degree 


~ Crystal Celsius, 

; gel) Cosel auc 

10713 m 
Rock salt, NaCl 2814.00 2819 .68 0.11 
Calcite, CaCO; 3029.45 3035.57 0.03 
Quartz, SiO: 4246 .02 4254.60 0.04 
Gypsum, CaSO,SII.0 7584.70 7600.0 0.29 
Mica 9942.72 9962.8 0.15 


7.6 Monochromatic Characteristic Radiation 


With a beam of x-rays from a platinum tar, 


face of a rock-salt crystal, W. H. Bragg rotated the crystal in steps of 
A0 and the ionization chamber in steps of 2A@. He plotted the curve 
of ionization current against glancing angle @ and found that the x-ray 
intensity did not vary uniformly with angle but rose at certain angles 
- to a sharp maximum. A curve similar to that. shown in Fig. 7.7 was 

obtained. A group of three maxima, aj, b, and cı, was observed at 
the respective angles 6 of 9.9, 11.6, and 13.6°. A second group of three 
maxima, @2, be, and cz, was observed at approximately double these 
angles. Bragg interpreted the maxima a, b» and cs as second-order 


get incident on the cleavage 
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Fig. 7.7 Bragg’s curve for the energy distribution in an x-ray spectrum, 
showing the characteristic lines a, b, c. 


reflections of the lines a, b1, and cı. He computed their wavelengths, 
taking n = 1 for the lines a;, bı, and cı and n = 2 for the second-order 
lines a», b», and c+, and obtained 0.97, 1.13, 1.32 A, respectively. 

Curves similar to Fig. 7.7 were obtained with other crystals, the 
only difference being that the maxima occurred at different glancing 
angles, indicating that each crystal had a characteristic grating space d. 
Bragg convinced himself, however, that these respective maxima for 
different crystals always represented the same monochromatic radiation, 
since, for example, the absorption in aluminum of peak 6; was always the 
same, whatever the crystal used. In short, the peaks of the curve in 
Fig. 7.7 represent spectral lines the wavelengths of which are characteristic 
of the target emitling the rays. These monochromatic lines are super- 
imposed on a continuous spectrum represented by the partially dotted 
line in the figure. Curves of the type shown in Fig. 7.7, therefore, repre- 
sent the distribution of energy in the x-ray spectrum, continuous and 
characteristic combined, of an element. : 

The development of the Bragg spectrometer made it possible to 
make precise measurements of x-ray wavelengths. In 1913 Moseley 
undertook a systematic study of x-ray spectra emitted by targets of a 
number of elements of medium and high atomic number. His results 
were soon supplemented by data from other observers. X-ray spectra 
of a number of different elements are shown in Fig. 7.8. For each ele- 
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M series lines 


Atomic number Z 


Wavelength, A 


Fig. 7.8 Wavelengths of some of the more prominent characteristic lines; 
only a small fraction of L and M lines are shown. 


ment the characteristic spectrum has a few lines at a relatively short 
wavelength, then an extended region in which there are no x-ray lines, 
followed in turn by a region in which there are a large number of lines, 
and so forth. The shortest wavelength and most penetrating lines 
emitted by an element are grouped in a narrow wavelength interval and 
are called the K lines; those in the next shorter wavelength region are 


er elements there are groups at still longer 
N lines. 
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Fig. 7.9 Schematic arrangement for studying | 
secondary radiation. Aluminum absorbing foils 
were inserted at A or at B. 


emitted as secondary radiation its own characteristic x-rays. The 
fluorescent rays emitted at right angles to the primary beam were 
measured by an ionization chamber. The absorption coefficient of 
the radiation from the secondary emitter was determined by placing 
aluminum foils between Æ and the ionization chamber. A large 
number of materials were used as the secondary radiator; Barkla found 
that the greater the atomic weight, the more penetrating this secondary 
radiation. Even when the hardness of the primary beam was varied 
over a fairly broad range, the secondary radiation from a given material 
remained unchanged. However, when the voltage across the x-ray 
tube was reduced sufficiently, there came a point at which the char- 
- acteristic radiation emitted by a radiator of high atomic number 
became much softer and more readily absorbed. With a further 
reduction in the hardness of the primary beam, the secondary radiation 
continued to have the same absorption coefficient. Thus, Barkla 
established the fact that secondary emitters of fairly high atomic 
number have characteristic radiation in at least two wavelength 
regions. He called the more penetrating rays the K ratllation and the 
softer L radiation. 3 ; 


7.7 Moseley’s Law 


The K series of each element, as photographed by Moseley, appeared 
as two lines, the stronger of which was named Ka and the weaker K6. 
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(Later measurements have shown that the Ka line is actually a doublet 
and that the line originally known as Kf is also complex.) For any 
given element the Ka line always has a longer wavelength than the KB 
line. Moseley found, as Fig. 7.8 shows, that the wavelength of a given 
spectral line decreases as the atomic number of the target is increased. 
There is no indication of the periodicity so characteristic of many atomic 
properties. 
Moseley discovered that the square root of the.frequency of the 
Ka line was nearly proportional to the atomic number of the emitter. 
Indeed, a plot of the square root of the frequency of almost any given 
x-ray line as a function of atomic number yields nearly a straight line. 
(Precise measurements have shown that the lines have a slight upward 
concavity.) Figure 7.10 shows the plot of the square root of the energies 
of the Ka and Kf lines as a function of atomic number. A graph in 


which the square root of the frequency (usually multiplied by a constant) 
of an x-ray line is plotted as a function of atomic number ia called a 
Tid 


e frequency of the Ka line for an element of atomic number 2 
can be represented very closely by the relation 


» = 0.048 X 101K 2 — 1)? Hz (7.2) 


Atomic weight 


Square root of frequency X 10-8 


Atomie number, Z 
Fig. 7.10 Moseley’, ; A 
lines and atomic Homer (Curae, viene ee i ee 


Ta 
or atomic weight (Curve A). H 
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For other lines it was found that an equation of the form 
v= A(Z = g)? (7.2a) 


gave reasonably good fits to experimental measurements. Here A and 
ø are constants for a given line. An explanation of Moseley’s empirical 
relations was almost immediately available in terms of a theory developed 
by Bohr (Sec. 9.4). 

One of the first important applications of Moseley’s work was the 
resolution of several problems which were of great interest in connection 
with the periodic table. It ‘was found that cobalt (at wt = 58.93) 
rather than nickel (at wt = 58.71) is element number 27. Similarly 
it was confirmed that argon (at wt = 39.948) should be placed before 


potassium (at wt = 39.102), as had been suggested by chemists. Still 
another contribution was the observation that element 43 was missing 


and that ruthenium has atomic number 44, The observation of the Ke 


series characteristic spectrum of any element is generally regarded as 
conclusive proof of the presence of this element in the target material. 


Fiver since Moseley's work, any claim for the discovery of a new ele: 


ment has available a definitive check in the form of the proper K x-ray 
spectrum, provided a large enough sample is available. 


7.8 The Continuous X-ray Spectrum 


Support for the hypothesis that electromagnetic radiation is emitted as 
photons of energy hv came from the continuous x-ray spectrum. When 
high-energy electrons fall on a target, there is always a broad continuous 
spectrum of x-rays emitted as well as some characteristic lines. The 
general features of the continuous spectra from all sorts of target ele- 
ments are essentially identical. Figure 7.11 shows the 1918 data of 
Ulrey for the continuous spectrum of wolfram (tungsten) for four poten- 
tial differences applied to the x-ray tube. Here J, dd is the power 
emitted in the wavelength region between à and à + dà. 

Three features of the curves strike the eye immediately. 

1. For each potential difference there exists a short-wavelength 
limit; i.e., the shortest wavelength emitted at a given tube potential 
difference is well defined, whereas on the long-wavelength side the 
intensity falls off gradually. In 1915 Duane and Hunt showed that the 
short-wavelength limit Amin is inversely proportional to the potential 
difference V. Clearly the maximum kinetic energy Kmax of an electron 
striking the target is Ve, and this is the largest energy that the electron 
can give up in producing electromagnetic radiation. If all this energy 
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The ratio of the power scattered to the primary intensity is called 


the scallering cross seclion (or scallering coefficient) of the free electron, 
designated by ce. 


_ 8r e? 


n=% (aoa) (7.9) 


The scattering coefficient has the dimensions of area; of the radiation 
incident on a unit area, the electron scatters the amount which would 
fall on the area øe which has the numerical value 0.666 X 10-8 m°, or 
0.666 barn (1 barn = 10-?4cm? = 10-*8m?). The Thomson (or classical) 
scattering coefficient is independent of wavelength. If each electron of 
an atom scatters independently, the atomic scattering cross section oa 
is the product of the atomic number of the atom and the electron cross 
section; oa = Zoe. 

Thomson’s theory predicts not only the intensity of the scattered 
beam but also its polarization. At @ = 0 and @ = 180°, the scattered 
beam from an unpolarized incident beam is unpolarized, but at 0 = 90° 
the scattered beam is plane-polarized with electric vector in the y direction. 

When a beam of x-rays of initial intensity Io passes through a thin 


layer of thickness Az of any material, scattering by electrons in the layer 
changes the intensity by an amount 


LAI = olar (1.10) 


where ø is called the linear scattering coefficient of the material. If each 
of the electrons of the material were to scatter independently of the 
others, the linear scattering coefficient would be 


8r e? s 
O = NT; = 3 (sina) n (7.11) 


where n is the number of electrons per unit volume. 

The ratio of ø to the density p of the scatterer is the mass scallering 
coefficient cm. For substances such as gases, the linear scattering coeffi- 
cient depends, of course, on the density, but om = c/p is constant. 
The first quantitative estimate of the number of electrons in an atom 
was made by Barkla and his coworkers in 1909 by applying Thomson’s 
theory to data on the scattering of x-rays by carbon. They passed a 
beam of Mo Ka x-rays through a graphite absorber and after correcting 
for photoelectric absorption found that the value of om is approximately 
0.02 m?/kg for carbon. If ¢ = ne., then om = no-/p, and the number 


X-rays 191 


of electrons per kilogram is 


To Cag 0.02 m?/kg 


po, 0.666 X 10-*8 m?/electron 
= 3 X 10% electrons/kg 


The number of carbon atoms per kilogram is the ratio of Avogadro’s 
number NV, to the atomic weight A, or 


Na _ 6.02 X 10%° atoms/kmole 


ae 12 kg/kmole 
5 X 1075 atoms/kg 


0.1 0.2 0.4 0.6 0.8 1.0 1.2 1.4 16 
Wavelength in A 


Fig. 7.14 Ratio of the scattering coefficients ¢, of various atoms to Zes the 
value which would pertain if each electron scattered independently with cross 
section ge. (Reproduced by permission of D. Van Nostrand Company, Inc., from 
Fig. I1I-3 in A. H. Compton and S. K. Allison, “ X rays in Theory and Ezperi- 
ment,” 1935.) Lees 
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Hence the number of electrons per carbon atom must be 6. This deter- 
mination was significant in the development of atomic theory, since it 
suggested the fact that the number of electrons per atom is less than the 
atomic weight and is perhaps about half of A. 

Later measurements of scattering coefficients for à > 0.3 A showed 
that for heavier elements the scattering coefficient e exceeds no, and 
hence the atomic scattering cross section sa is greater than Zo, (Fig. 
7.14). The excessive scattering is easily explained classically. When A 
becomes comparable with the distances between electrons in an atom, 
they no longer scatter independently; the waves scattered by neighboring 
electrons are superposed more or less in the same phase, and so construc- 
tive interference occurs. As À increases from a very small value, at first 
the inner (K) electrons scatter coherently; for longer wavelengths more 
electrons scatter nearly in phase. When à is large compared to the 
atomic diameter, the relative phases of the waves from all Z electrons of 
an atom are small, and so the amplitude of the scattered wave from the 
atom is approximately proportional to Z and the scattered intensity to 
Z. The theory of coherent scattering was worked out by Rayleigh, the 
cooperative scattering of the atomic electrons often being referred to as 
Rayleigh scallering. The decrease in o at short wavelengths is predicted 
by the relativistic, quantum-mechanical treatment of scattering due to 
‘Klein and Nishina (Sec. 24.2). 

From the curves of Fig. 7.14 one sees what fortuitous choices were 
involved in Barkla’s successful measurement of the number of electrons 
per carbon atom. Had he selected a scatterer of higher atomic number 
or used Cu Ke radiation (1.5 A) rather than Mo Ka (0.707 A), his 
calculation would have been substantially in error. 


7.10 Compton Scattering 


According to the classical theory of scattering, atomic electrons are 
driven to perform simple harmonic motion by the electric intensity of 
the incident waves; the accelerated electrons reradiate at the same 
frequency. Such scattering at the frequency of the incident radiation 
is observed over the entire electromagnetic spectrum. However, it is 
weak at high frequencies. Before 1914, Gray and others had shown that 
scattered x-rays are more readily absorbed than the incident beam, which 
implies that on the average the scattered photons are of lower energy- 

In 1923 Compton published the results of careful measurements of 
the x-ray frequencies scattered by carbon atoms upon which mono- 
chromatic x-rays were incident. He found that the scattered beam con- 
„tained two frequencies, one the same as that of the incident beam and the 
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AA = 0.0074 


Lead box 


Fig. 7.15 The scattering of Mo Ka x-rays (à = 0.707 A) at an angle 9 gives rise to two peaks in the scattered radiation, one at the 
'S incident wavelength and the second at a wavelength greater by AX = 0,024(1 — cos 0) A. (Afler A. W. Smith and J. N. Cooper, 
& “Blements of Physics,” Uh ed. Copyright 1964. McGraw-Hill Book Company. Used by permission.) 
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second somewhat lower. On the basis of the quantum theory of radia- 
tion, Compton derived a relation which quantitatively predicted scat- 
tering at the observed lower frequency. This Compton effect represents 
one of the most conclusive evidences of the particle properties of electro- 
magnetic radiation and probably convinced more physicists that quantum 
mechanics should be accepted than any other experiment. 

Compton used a graphite block to scatter Mo Ka radiation and 
measured the wavelength of the scattered photons with a Bragg spectrom- 
eter (Fig. 7.15). At each finite scattering-angle, he observed two peaks 
in the scattered beam, one at the incident wavelength 0.707 A and the 
second at a wavelength longer by an amount A) dependent on the 
scattering angle 0, according to the relation Aà = 0.024(1 — cos 0) A 
Compton subsequently showed that Ad is independent of the scattering 
material. 

To explain the occurrence of the shifted component, Compton 
boldly applied the quantum picture of radiant energy. He assumed 
that the scattering process could be treated as an elastic collision between 
a photon and an electron, governed by the two laws of mechanics, the 

' conservation of energy and the conservation of momentum. Let an 
incident photon of energy hy collide with an electron initially at rest. 
The photon is scattered through an angle 0, while the electron recoils 
in a direction ¢ (Fig. 7.16). The kinetic energy K given to the electron 
is (m — mo)c? by Eq. (2.12). If və is the frequency of the scattered 
photon, the conservation of energy requires that the sum of the kinetic 
energy of the electron and the energy of the scattered photon be equal 


Incident photon 


hv/e = h/\ 


Momentum 


Energy hv 


Momentum p o! 
recoil electron 
with kinetic energy 
K = (m — mo)? 


Fig. 7.16 Elastic collision of a g&gton with alectron initially at rest. 


en 
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to the energy of the incident photon, or 
hv = hve + (m — mo)c? (7.12) 
Each photon carries momentum equal to its energy hv divided by 


the speed of light c, or h/à. Since momentum is a vector quantity and 
is conserved, the x and y components (Fig. 7.16) must obey the equations 


ie = fis cos 0 + pcos ¢ (7.134) 
hvo . t- 
0= — sin 6+ psin (7.136) 


where p is the momentum of the recoil electron. (The plus sign appears 
with sin ¢ because it is assumed all angles are positive when measured 
counterclockwise from the z axis; thus in Fig. 7.16 ¢ is a negative angle.) 

If we introduce the wavelength of the two photons through »/c = 1/ 
and recall that p = mov/+/1 — (v/e)?, we may rewrite Eqs. (7.13a, b) 
thus: 5 


h _heos@_ _mw cos o ps ba Mod sin y 
A Ne 1 — (w/c)? á M VI = w/e? 


Squaring these two equations and adding them eliminates » and 
leaves 


h? | A? _ 2h? mè? smote? 
? $ M? Mo cose I= (@/) I — (o/c)? —my%e? (7.14) 


Similarly, Eq. (7.12) can be divided by ¢ and written 


Gih ard Meath 

Nea + moc Vi Get (7.12a) 
and squared to give 

ka E we okt eral Seta 

E aF Xe Na + 2moch G i) + mc? = l= vë w/e (7.15) 


Subtracting Eq. (7.14) from (7.15) yields 


2h? liea 
20 (1 — cos o) — 2mh ($ - E) = 0 r 
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or 


h 
An MS N= og be — 00816) 


0.02426(1 — cos 0) A (1.16) 


Thus, the wavelength shift predicted by Compton’s theory is in excellent 
agreement with the experimental results. Compton scattering reveals 
the particle characteristics of photons. 

The quantity h/mec is known as the Compton wavelength of the elec- 
tron; it corresponds to the wavelength of a photon with energy equal to 
the rest energy moc? of the electron. It is sometimes useful to extend 
this idea to other particles; thus the Compton wavelength of a proton 
of rest mass M, is h/M,c. 

The Compton shift in wavelength, Eq. (7.16), depends on the 
scattering angle but is independent of the energy of the incident photon. 
On the other hand, the energy difference between incident and scattered 
photon increases rapidly as the energy of the incident photon is raised, 
as can be readily found by rewriting Eq. (7.16) in the form 


ip he OEE (1 — cos 6) (7.16a) 


In Compton’s experimental work there appeared not only a modi- 
fied line but also a line with the same wavelength as the primary. This 
unmodified line can be explained in terms of the photon concept as the 
result of a collision of a photon with a bound electron, held sufficiently 
tightly for an entire atom (or even an entire crystal) to recoil. In such a 
collision the predicted wavelength shift is given by replacing mo by the 
mass of the recoiling atom in Eq. (7.16) and is so small as to be unobserv- 
able for typical x-ray photons. 

The substantially greater breadth of the modified line compared 
to the unmodified line can be explained satisfactorily in terms of the 
fact that most scattering electrons are not initially at rest but have a 
range of initial momenta and energies. 


7.11 Compton Recoil Electrons 

The recoil electrons predicted by Compton’s theory were promptly looked 
for and found by C. T. R. Wilson and by Bothe. The kinetic energy 
K = (m — mo)c? of the recoil electrons is given by Eq. (7.12) 


K = (m — mo)? = hv — hv (7.12b) 
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If e represents hy/moc*, it can be shown that 


= 2e cos? y 
K=b app eo, (7.17) 
e(l — cos 6) 
K = hv I+ «(1 — cos 6) (7.17a) 


By 1927 Bless, using the magnetic spectrograph, showed that the observed 
values of K for the recoil electrons are in agreement with the theory. 
The relation between the scattering angles, ¢ for the electron and @ for 
the photon, is given by 


orp =W Eo tan 5 (7.18) 


Compton’s theory is based on the assumption that the recoil of the 
electron occurs simultaneously with the scattering of the photon and that 
both energy and momentum are conserved in the collision. The simul- 
laneity and conservalion requirements have been tested experimentally 
with great care by several observers by use of scintillation detectors and 
coincidence techniques. One detector fixes @ and measures hye for the 
photon, while a second detector gives p and K for the recoil electron. 
Within experimental error, for every Compton photon scattered an angle 
0 there is a recoil electron at angle ¢ given by Eq. (7.18) with kinetic 
energy given by Eq. (7.17b). The simultaneity time limit is less than 
2 X 10-8 s in many experiments and in some cases less than 10-" s. 


Throughout this discussion the conventional assumption has been 
made that the recoil electron was initially unbound and at rest, an 
approximation which is tenable only when the kinetic energy acquired 
by the recoil electron is much greater than the energy with which it 
is bound toitsatom. The effect of the binding is to reduce A) slightly. 
In addition to the Compton process, in which the electron is freed 
from the atom, there are also Smekal-Raman processes, in which the 
frequency of the scattered radiation is changed by an amount which 
corresponds to transition between two bound states of the scattering 
atom. 

The Compton or modified line is very broad at comparatively 
low frequencies. This breadth can be thought of as caused by the 
motion of the electrons in the atom. In our simple deduction of the 
Compton effect, the electron was assumed to be initially at rest; if it 
is assumed to have a component of velocity, positive or negative, in 
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the direction of the incident radiation, the wavelength shift is different. 
Wave mechanics furnishes a probability distribution for the velocities 
of the atomic electrons, from which the broadening can be computed, 
in agreement with experiment. As v is then increased further, the 
Compton line becomes narrower. Eventually, in any direction of 
scattering other than that of the incident beam, the unmodified Jine 
becomes weaker than the Compton line, sooner at large angles of scat- 
tering than at small angles, and sooner for heavy atoms than for light 
ones. Finally, only the modified line remains in appreciable intensity. 


f 
7.12 The Nature of Electromagnetic Radiation 


`The question as to the true nature of electromagnetic radiation arises 
in its sharpest form in contemplating the contrasting properties of 
x-rays. How can an entity exhibit the wave behavior that is evidenced 
in the diffraction of x-rays by crystals and, on the other hand, the 
particle properties that are revealed by the Compton effect? 
According to quantum mechanics, as interpreted by Bohr and 
other theoretical physicists, the riddle can be solved only by extending 
to radiation a limitation upon the use of ordinary space-time concep- 
tions. Bohr insisted that whenever an observation of any sort is 
made, its immediale result is always expressible in terms of familiar 
ideas of space and time, since these ideas have been developed out of 
human. experience and any observation necessarily includes as its 
primary stage a certain experience by a human observer. But it 
does not follow that it will always be possible to construct a picture 
of the physical reality that causes these experiences in the same way 
in which we picture everyday objects. In classical theory, an electro- 
magnetic field was assumed to have a certain character at every point 
in space and at every instant of time, as represented by certain values 
of the electric and magnetic vectors. According to the new view, 
such á conception of the electromagnetic field is valid at best as an 
approximation. i te one 
The essential significance of what we call a radiation field lies in 
its effects upon the motion of charged particles. It is possible for the 
field to exhibit contrasting characteristics under different circum- 
stances. At one extreme, the action of the field takes the form with 
which we have become familiar in the Compton effect. Here a photon 
appears to bounce off an electron, thereby changing the momentum of 
the electron suddenly and discontinuously. The change of momen- 
tum ép will be definitely observable and measurable by the physicist 
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[only if it considerably exceeds the range of indeterminateness of the 
| momentum already possessed by the particle. A clear-cut scattering | 
process of the Compton type can occur only when the wavelength of the 
radiation is shorter than the diameter of the region in which the particle 
may be supposed to be effectively located. For an electron in an atom, | 
this condition can be met only for wavelengths considerably shorter 
than the atomic diameter, such as the wavelengths of hard x-rays: 

or y rays. 

In the Compton process there is no feature that can be regarded 
as a manifestation of an electric intensity in the wave. In order to 
obtain, at the opposite extreme, an action of radiation that can be 
interpreted in terms of the familiar electric and magnetic vectors, two 
conditions must be met. The experimental conditions must be such 
that it is possible to follow the test particle in effectively continuous 
motion along a path, so that its acceleration can be determined, It is 
also necessary that a segment of path which is sufficiently long to per- 
mit an adequate determination of the acceleration shall yet be short 
enough to ensure that along it the field vectors do not vary appreci- 
ably in value. Thus the familiar action ascribable to electric and mag- 
netic fields in the radiation is obtained only when the particle acted on is 
definitely located within a region much smaller than a wavelength. This 

“latter condition is satisfied, for example, by the electrons in a wire 
held parallel to the electric vector in long electromagnetic waves. In. 
such a wire alternating electric current is observed to be produced, — 
varying in phase with tne electric vector. Tons in the upper atmos-_ 
phere acted on by such waves furnish another example. a 
A Tn such cases the magnetic field in the waves will also act upon 

| the current or on the moving ions. It thus comes about that the 
‘average result is a force in the direction of propagation of the waves. 
This force constitutes an example of light pressure, and it can be’ 

. regarded as analogous to the Compton recoil that occurs under other 
circumstances; but here there is a continuous rather than a discon- 
tinuous action. 5 85 aio s anana T 


Trebuil 


Problems 
l. Find the grating space for KBr and KCI, which have densities of 
2750 and 1984 kg/m? respectively. 
Ans: 3.30 Å; 3.14 Å 


2. Find the wavelength for which first-order Bragg reflection occurs at 
a 20° glancing angle from calcite. At which angle does second-order 
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reflection occur? What is the glancing angle for Mo Ke radiation 
(0.707 A) in first order? 

Ans: 2.076 A; 43.2°; 6.7° 
3. An x-ray tube with a copper target is operated at a potential differ- 
ence of 25 kV. The glancing angle with a NaCl crystal for the Cu Ka 
line is found to be 15.8°. Find the wavelength of this line and the 
glancing angle for photons at the short-wavelength limit. 

Ans: 1.54 A; 5.1° 
4. An x-ray tube with a tungsten target is operated with a de power 
supply giving 15 mA at a potential difference of 60 kV. What is the 
short-wavelength limit of the x-rays produced? Estimate the approxi- 
mate power radiated in x-rays. 

Ans: 0.207 A; about 6 W 
5. Find the short-wavelength limit for x-radiation from (a) a television 
tube operated at 10 kV, (b) an x-ray tube operated at 100 kV, (c) a 
20-MeV betatron, and (d) a 1-GeV linear electron accelerator. 

Ans: 1.24 A; 0.124 A; 0.62 XU; 0.0124 XU 
6. Find the short-wavelength limit for the radiation from (a) an x-ray 
tube operated at 30 kV, (b) a 50-MeV electron linac (linear accelerator), 
and (c) a 300-MeV synchrotron. 
7. One of the best proofs for the discovery of a new element lies in 
obtaining the characteristic x-ray spectrum predicted for that element. 
Plot a Moseley graph for the Ka line and predict to three significant 
figures the wavelength of this line for elements 85, 87, and 94. 

Ans: 0.150, 0.147, 0.135 A 
8. Assuming that all scattering is of the incoherent Thomson variety, 
find the atomic, mass, and linear scattering coefficients for copper. Judg- 
ing by Fig. 7.14, at what wavelength would you expect these values to be 
approximately correct? At \ = 0.707 A (Mo Ka) what would you esti- 
mate ga to be for Cu? 
9. Jnpolarized x-rays undergo Thomson scattering at a graphite block. 
Finu the polarization of the radiation scattered in the zz plane at 0 = 30°. 
(The polarization is defined as 
I iI 
+I 
where I, and Ty refer to the intensities scattered with the electric vector 
respectively normal and parallel to the zz plane.) 


10. X-rays of à = 0.085 A are scattered by carbon. At what angle 


will Compton scattered photons have a wavelength of 0.090 Ap 
Ans: 37.5° 
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11. A photon is Compton scattered by an electron through an angle of 
90°. Find the energy of the scattered photon for incident-photon ener- 
gies of 10 keV, 0.511 MeV, and 10 MeV. 

Ans: 0.0098, 0.256, and 0.486 MeV 
12. Show that the kinetic energy of a Compton scattered electron is 
given by Eq. (7.17) in terms of ¢ or by Eq. (7.17) in terms of 6. 
13. Find the smallest energy a photon may have and still transfer one- 
half of its energy to an electron initially at rest. 

Ans: 256 keV 
14. Show that the recoil angle for a Compton electron is given by 
_ (hv)? — (hve)? + pc? 
CORR 2hvpe 


where p is the momentum of the electron. 
15. A cosmic-ray photon of energy hy is scattered through 90° by an 
electron initially at rest. The scattered photon has a wavelength twice 
that of the incident photon. Find the frequency of the incident photon 
and the recoil angle of the electron. 

Ans: 1.2 X 10% Hz; 26.6° 
16. Derive Eq. (7.18). 
17. A photon of high energy hv makes a head-on collision with a free 
electron at rest. 


(a) Apply the conservation laws to this collision to show that 
2hv = K + VK? + 2me?K, where K is the kinetic energy of the recoil 
electron. 


(b) If hv = 10 MeV, find K. 
Ans: 9.75 MeV 


18. For a photon which has undergone Compton scattering, show that 
the energy is 


hy = ives Eee 
1 + e(1 — cos 0) 
where e = hv/moc?. 


19. A 0.511-MeV photon strikes an electron head on and is scattered 
straight backward. Find the energy of the scattered photon, the energy 
of the recoil electron, and the ratio v/e for the latter. 

20. Ifin a Compton scattering the recoil electron moves off at angle ¢, 
show that the photon recoils at an angle 6 given by 


2 


coso = l- Rehan oF 1 
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21. Verify that a freely moving electron cannot spontaneously emit or 
absorb a photon. 

22. When a high-energy photon undergoes Compton scattering at an 
angle of 30°, the recoil electron is projected with ¢ = 30°. Find the 
energy of the incident photon. 

Ans: 2.78 MeV 
23. (a) The average intensity of a beam of plane-polarized electro- 
magnetic radiation is 4ceoEo’, where Eo is the amplitude of the electric 
vector. Calculate the classical amplitude of vibration of a free electron 
under the influence of a beam of x-rays with wavelength 0.7 A and an 
intensity of 20 W/m’. 
(b) What power is reradiated as scattered x-radiation by the 
electron? 
(c) Find the intensity required to give the electron an amplitude 

of 0.05 Å. ` 

24. A spectrometer to determine the energy of incident y rays operates 
as follows. Collimated y rays strike a thin target, and Compton recoil 
` electrons proceeding in the direction of the original y's are analyzed 
magnetically. (Note that the Compton scattering angle is then 180°.) 
If r is the radius of the electron’s path and B is the magnetic induction, 
derive a relativistically correct equation for the y-ray energy in terms 
of r, B, and fundamental constants. 

25. In the inverse Compton effect a photon gains energy in an elastic 
collision with a high-energy electron. If in the laboratory reference 
frame, a 10-eV photon collided head on with a 500-MeV electron, find 
the energy of the Compton scattered photon in the laboratory system. 
26. One means of producing monoergic photons with high energy (and 
nearly 100 percent polarization) is through the inverse Compton effect 
(see Prob. 25). If an intense laser beam of 6942-A photons is made to 
collide head on with a beam of 20-GeV electrons, find (a) the energy of 
the incident photons in a reference frame moving with the electrons, 
and (b) the energy of the back-scattered photons in the laboratory 
reference frame. Some of the high-energy photons observed in cosmic 
rays may have gained energy through the inverse Compton process. 


chapter eight 


[Radioactivity and the Nuclear Atom 


The discovery of radioactivity by Henri Becquerel in 1896 marks the 
starting point of nuclear physics. As a result of his work and the investi- 
gations by the Curies, Rutherford and his collaborators, and many others, 
it became evident that this phenomenon involves the operation of forces of a 
different order of magnitude from any familiar to chemists or physicists of 
that time and that one was dealing with the actual transmutation of the 
elements, for which men had searched in vain since the time of the Egyplians. 
As a direct outgrowth of these studies came the identification of a host of 
materials possessing the remarkable property of spontaneously transforming 
from one into another, and with the unraveling of these successive transforma- 
tions came the first clue to the intimate relation between the elements. Exploi- 
tation of the enormously energetic radiations which these substances emitted 
led to the discovery of the nucleus itself, to the nuclear model of the atom, 
and finally to the artifically induced transmutation of the stable elements. 
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plate, he found that the most easily deflected 8 particles were the most 
easily absorbed. He soon found that the g particles could also be 
deflected electrostatically and set out to determine their specific charge 
e/m by a comparison of the magnetic and electric deflections. A beam 
of particles traveling perpendicularly to an electrostatic field E suffers'a 


displacement y perpendicular to the motion after traveling a length / in 
the field, where, by Eq. (6.6), 


_ 1 Ee iN? 
3m \e 


Inserting the value of Bp for the rays, Becquerel could obtain v, 
and hence also e/m. He found a velocity of more than half the speed 
of light and obtained a value for e/m of about 101! C/kg (the charge 
was negative). This observation suggested that the £ rays were to be 
identified with the electrons observed as the cathode rays by Thomson 
(only 3 years earlier) but showed that their velocities greatly exceed 
those of the cathede rays. Later work by Kaufmann in 1902, who used 
simultaneous electric and magnetic deflections, gave a more accurate 
value of e/m and showed the variation of mass with velocity that was 
later to be explained by the special theory of relativity. The enormous 
velocities observed, up to 0.96c, emphasized again that radioactivity was 
no ordinary chemical phenomenon. 

The magnetic deflection of the « rays was demonstrated by Ruther- 
ford in 1903. How difficult they were to deflect may be judged from 
Rutherford’s own figures, according to which, in a field of 1 Wb/m?, the 
radius of curvature was some 39 cm, while that of the cathode rays from 
an ordinary tube would be only 0.01 cm! The sense of the deflection 
indicated a positive charge. Applying the methods of magnetic and 
electrostatic deflection, Rutherford obtained Values of v = 2.5 x.107 m/s 
and e/m = 6 X 107 C/kg for the a particles of RaC’ (Po?!4), a derivative 
of radium. Unlike the £ particles, the a particles turned out to occur 
in groups with discrete velocities. An improved evaluation showed that 
e/m ior a particles was half that for hydrogen ions, which suggested 
identification with doubly ionized helium. This suggestion was partic- 
ularly attractive since it was well known that helium is always present 
in radioactive minerals, 

In a series of experiments reported in 1908, Rutherford and Geiger 
measured directly the number of æ particles emitted per gram of radium 
and the total charge carried by them, thus making possible a determina- 
tion of the charge of the a particle. The apparatus for the first meas- 
urement is illustrated in Fig. 8.2. An evacuated vessel A contains a 
thin deposit of radium on a holder DÐ. When the stopcock F is opened, 
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Fig. 8.2 Apparatus of Rutherford and Geiger for counting the number of « particles 
emitted by radium. Particles from the source D pass through aperture S and are 
registered in the counter C. (Adapted from E. R. Rutherford, J. Chadwick, and C. D. 
Ellis, “ Radiations from Radioactive Substances,” Cambridge University Press, 1930.) 


the a particles pass through an aperture S, which is covered by a thin 
mica toil, into the counter C. The counter consists of a brass tube, 
provided with ebonite plugs through which a fine center wire passes. 
The tube is connected to the negative side of a high-voltage battery, the 
positive end of which is grounded, and the center wire is connected to 
an electrometer E provided with a high-resistance leak to ground. With 
the counter exhausted to a few centimeters pressure, the passage of a 
single a particle produces a momentary discharge which causes the elec- 
trometer to deflect. With this instrument, Rutherford and Geiger found 
that 3.4 X 10" a particles were expelled per gram of radium per second 
(actually, from the amount of RaC’ (Po?"*) in equilibrium with 1 g of 
radium). In the second experiment, they determined by direct measure- 
ment the amount of charge collected on a plate suspended above the 
preparation in an evacuated vessel. 8 rays and secondary electrons 
were eliminated by putting the apparatus in a strong transverse magnetic 
field. The result was 1.05 X 10-5 C/kg-s. Dividing this number by 
the total number of «æ particles gave for the charge per «æ particle 3.1 X 
10-!* C, which corresponds to two elementary charges. There could be 
little doubt that the a particles were actually helium ions. Even more 
conclusive evidence was obtained in 1909, when Rutherford and Royds 
detected spectroscopically the helium gas formed when a particles pass 
into a closed glass vessel. 

The velocity obtained by Rutherford for the « particles from Po? 
implies a very high energy indeed. The mass of the helium atom, 
obtained from the atomic weight and Avogadro's number, is 


4.00 


M = 505 


X 10-* = 6.64 X 10-7 kg 
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The kinetic energy is then įm? = 1.23 X 10-7" J = 7.7 MeV. Thus 
the emission of each a@ particle involves an energy change of several 
million times the energy release in a chemical reaction! It was pointed 
out by Rutherford and Soddy that the presence of only a few times 10-4 
parts by weight of radium in the earth would compensate for the heat 
lost from the earth by radiation. They also suggested that the heat of 
the sun might be derived from radioactive substances. Actually we now 
have a more sophisticated theory of the sourcé of solar and stellar energy 
(Sec. 25.22). 

The third component of the radioactive radiations, the y rays, 
proved less amenable to study than the a and £ rays. The y rays are 
much more penetrating, and they are not deflected by electric or mag- 
netic fields. They do, however, produce ionization in air and fluorescence 
in many kinds of crystals; it is by these processes that they are ordinarily 
detected. It was conjectured early that they might be a form of electro- 
magnetic radiation, like x-rays, but the proof of this hypothesis had to 
await development of a much more complete understanding of the 
interaction of electromagnetic radiation with matter. 


8.3 Early Views on Atomic Structure 


Speculations about the structure of the atom date from the early years 
of the nineteenth century. In 1815 Prout proposed the hypothesis that 
the atoms of all elements are composed of atoms of hydrogen; but this 
view failed of acceptance when it was established that the atomic weights 
of some elements differ markedly from multiples of the atomic weight 
of hydrogen. 

The discovery of the electron in 1897 led to renewed interest in the 
internal structure of atoms by indicating that they must contain both 
negative electrons and positive charges. Two questions then arose: (1) 
How many electrons are there in an atom? (2) How are the electrons 
and the positive charges arranged? Between 1909 and 1911 Barkla 
obtained evidence from the scattering of x-rays, which indicated that the 
number of electrons in an atom of the lighter elements was equal to about 
half the atomic weight. As to the arrangement of the electrons, it 
seemed on the basis of classical ideas that two conditions must be met. 
(1) The ensemble of positive charges and electrons in an atom must 
be stable; the electrons, for example, must be held by forces of some sort 
in fixed positions of equilibrium about which they can vibrate, when 
disturbed, with the definite frequencies required to explain the char- 
acteristic line spectra of the elements. »..(2) Except when so disturbed, 
the electrons must be at rest, since otherwise they would emit radiation 
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in accord with electromagnetic theory. The much larger mass of the 
atom as a whole was assumed to belong to the positive charges, which for 
this reason would vibrate very little. pi 

A possibility considered by J. J. Thomson was that the positive elec- 
tricity might be distributed continuously throughout a certain small 
region, perhaps with uniform density throughout a sphere. The electrons 
might then be embedded in the positive electricity, occupying normally 
certain positions of equilibrium and executing harmonic vibrations about 
these positions when slightly disturbed. Frequencies in the visible spec- 
trum might thereby be emitted if the sphere of positive electricity were 
of the order of 10-"° m in radius. But Thomson was unable to show that 
these frequencies could be such as to form a series converging to an upper 
limit, and eventually his theory came into conflict with the experiments 
of Rutherford and his collaborators on the scattering of « particles, which 
are now to be described. 


8.4 The Scattering of Alpha Particles by Atoms 


The a rays from radioactive materials had been shown to be helium atoms 
which have lost two electrons. The « particles, whose initial velocity 
is of the order of 2 X 107 m/s, could be studied by means of the flashes 
of light, or scintillations, they produce on striking a zinc sulfide screen, 
the impact of a single particle producing a visible flash which is readily 
observed under a low-power microscope. 

If a stream of « particles, limited by suitable diaphragms to a 
narrow pencil, be allowed to strike a zinc sulfide screen placed at right 
angles to the path of the particles, scintillations occur over a well-defined 
circular area equal to the cross section of the pencil. If, now, a oe 
film of matter, such as gold or silver foil, is interposed in the path © 
the rays, it is found that they pass quite readily through the foil but 
that the area over which the scintillations occur becomes ee and 
its definite boundary, indicating that some of the particle haverbeen 
deflecte ir original direction. Fas a 

a eer to explain the origin of the forces which 
cause the deflection of the « particle. The particle itself has a twofold 
positive charge. The atoms of the scattering material contain charges, 
both positive and negative. In its passage through the scattering 
material, the particle experiences electrostatic forces the magnitude and 
direction of which depend on how near the particle happens WOO OLIN 
to the centers of the atoms past which or paoue hy hoy ES j 

If we assume the Thomson atomic model, the path of an @ particle 
in passing through an atom might be as indicated in Fig. 8.3a, the net 
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(a) (b) 


Fig. 8.3 Comparison of deflection of an æ particle by (a) the Thomson 
atom and (b) the Rutherford nuclear atom. 


result of the passage being to deflect the particle through a small angle 8. 
That @ should be small is made plausible by the arguments below. 

The atom is electrically neutral, so that an « particle passing along 
a diameter of the atom should experience no net force from the positive 
charge and negligible deflections from those electrons which it misses 
widely. But if there were a direct hit on some electron, the recoil 
velocity of the electron could not exceed 2v, where v is the initial speed of 
the particle, since even in an elastic collision the velocity of separation 
cannot exceed the velocity of approach. Hence the maximum momen- 
tum which can be transferred to an electron, and therefore lost by the 
a particle, is 2mv, where m is the electron mass. Surely then the com- 
ponent of the momentum normal to its initial momentum must be less 
than 2mv, and the maximum scattering angle max must be less than 
2mv/Mo, where M is the mass of the a particle, or much less than 10-* rad. 

Finally the effect of the positive core cannot produce a large 
deflection. For any passage through the atom there are volume ele- 
ments of positive charge symmetrically arranged on each side of the path, 
and the contributions from these cancel. There remains some positive 
charge elements for which there are no canceling elements, but even if 
one ignores the electrons embedded therein, the maximum deflection 
expected is of the order of 10-4 rad. The total deflection of a particle 
in passing through a number of atoms in a thin layer of scattering material 
varies according to the laws of probability. Such a process is called 
multiple or compound scattering. According to Rutherford, the number 
of a particles N, which, as a result of mulliple scattering, should be 
scattered by such an atom through an angle ¢ or greater is given by 
Ng = Noe~!#»)", where No is the number of particles for $ = 0 and 
$m is the average deflection after passing through the scattering material. 

Geiger had shown experimentally that the most probable angle of 
deflection of a pencil of a particles in passing through gold foil 3755 mm 
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thick is of the order of 1°. It is evident, therefore, from the last. equa- 
tion, that the probability for scattering through large angles becomes 
vanishingly small; for 30°, it would be of the order of 10-!*. Geiger 
showed that the number of particles scatlered through large angles was much 
greater than the theory of multiple scallering predicted. Indeed, Geiger and 
Marsden showed that 1 in 8000 «a particles was turned through an angle 
of more than 90° by a thin film of platinum. This so-called reflection, 
however, was shown to be not a surface phenomenon but a volume effect, 
since the number of particles turned through more than 90° increased, 
up to a certain point, with increasing thickness of the scattering foil. 


8.5 The Nuclear Atom 


a. Rutherford’s Hypothesis In a classic article published in 1911, 
Rutherford proposed a new type of atomic model capable of giving to 
an æ particle a large deflection as a result of a single encounter. He 
assumed that in an atom the positive charge and most of the atomic mass 
are concentrated in a very small central region, later called the nucleus, 
about which the electrons are grouped in some sort of configuration. 
Since normally atoms are electrically neutral, the positive charge on the 
nucleus must be Ze, where e is the numerical electronic charge and Z 
is an integer, characteristic of the kind of atom. 

The deflection of an a particle bý such a nucleus might be as 
illustrated in Fig. 8.3b. Rutherford calculated the distribution of « 
particles to be expected as the result of single-scallering processes by 
atoms of this type. According to the Rutherford scattering formula, 


derived in Appendix 8A, the number Nz of scattered a particles reaching 
a small detector of area A is given by 


_ NntA — 4Z%e4 1 


Na = “GRE Torem aint (6/2) (8.1) 


where N; = number of a particles incident on a foil of thickness £ with 
n nucléi of atomic number Z per unit volume 
R = distance of detector from foil 
m = mass of incident a particles 
vo = speed of incident æ particles 
¢ = scattering angle 
e = charge of a proton 
eo = permittivity of free space 
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This formula is subject to correction, however, for the motion induced 
in the scattering nucleus, which may be appreciable for light atoms. 

According to Rutherford’s formula, the number of particles striking 
a small detector with its sensitive face perpendicular to the direction 
of motion of the scattered particles should be: 


1. Proportional to the square of the atomic number Z of the scattering nuclei 
2. Proportional to the thickness £ of the foil 


3. Inversely proportional to the square of the kinetic energy of the particles, 


i.e., to vot 

4. Inversely proportional to the fourth power of the sine of half the scattering 
angle ọ 

b. Experimental Confirmation These predictions were completely veri- 


fied in 1913 by the experiments of Geiger and Marsden. Their data are 
shown graphically in Fig. 8.4, in which the logarithm of the number of 
scintillations on the screen per minute is plotted as abscissa against the 
logarithm of 1/sin‘ (¢/2) as ordinate. If thee two quantities are propor- 
tional to each other, the points for each substance should lie on a straight 
line inclined at 45° with the axes. The two lines in the figure are drawn 
at exactly 45°, and the observed points are seen to agree well with the 
predictions. This is the more remarkable since the numbers of scintilla- 


| o Silver 
e Gold 


log 1/sin'4/2 


ol ve | Deg. 1 | 


1 2 3 4 fy ES 


log of scintillations per unit area at 
angle ¢ from incident direction 


Fig. 8.4 Data of Geiger and Marsden on the scattering of æ 
particles by thin foils. 
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tions varied in the experiment over a very wide range of values, the 
left-hand points on the plots representing 22 particles per minute for 
silver and 33 for gold, whereas the right-hand points represent 105,400 
and 132,000, respectively. 

Rutherford’s prediction that the scattering should, for small thick- 
nesses, be proportional to thickness was confirmed by the observations. 
Geiger and Marsden also showed that “the amount of scattering by a 
given foil is approximately proportional to the inverse fourth power of 
the velocity (inverse square of the energy) of the incident a particles,” 
over a range of velocities such that the number of scattered particles 
varied as 1:10. 


c. Alomic Number Geiger and Marsden further concluded, from a 
study of the variation of scattering with atomic weight and of the fraction 
of the total number of incident particles scattered through a given angle, 
(1) that the scattering is approximately proportional to the atomic weight 
of the scatterer over the range of elements from carbon to gold, and (2) 
that “the number of elementary charges composing the center of the atoms is 
equal to half the atomic weight.” This second conclusion was in agreement 
with Barkla’s experiments on the scattering of x-rays. Thus, carbon, 
nitrogen, and oxygen should have, respectively, 6, 7, and 8 electrons, 
around a nucleus containing, in each case, an equal amount of positive 
charge. These elements are, respectively, the sixth, seventh, and eighth 
elements in the periodic table. The hypothesis was natural, therefore, 
that the number of electrons in the atom, or the number of units of positive 
charge on its nucleus, is numerically equal to the ordinal number which 
the atom occupies in the series of the elements, counting hydrogen as the 
first. Thus originated the concept of the alomic number of an element, 
which we may think of variously as (1) the ordinal number of the ele- 
ment in the series of the elements starting with Z = 1 for hydrogen, or 
(2) the positive charge carried by the nucleus of the atom, in terms of 
the numerical electronic charge e as a unit, or (3) the number of electrons 
surrounding the nucleus in the neutral atom. 

These experiments of Geiger and Marsden so completely confirmed 
the conclusions which Rutherford had reached by postulating the nuclear 
type of atom that in spite of certain weighty objections his atomic model 
was at once adopted. 


d. Some Difficulties The objections to the nuclear type of atom were 
based on questions of stability. No stable arrangement of positive and 
negative point charges at rest can be invented; it can be proved, in fact, 
that such an arrangement is impossible (Earnshaw’s theorem). An 
electron might be imagined to revolve around a nucleus as the earth. 


Zá — Antroduetion lo Modern Physics 


revolves around the sun. » But according to the laws of classical eectro- 


magnetic theory, it would radiale energy. The system will, therefore, 
“run down’’; the electron will approach the nucleus along a spiral path 
and will give oul radiation of constantly increasing frequency. This does 
not agree with the observed emission of spectral lines of fixed frequency. 

It was at this point that Bohr introduced his epoch-making theory 
of the structure of the atom and of the origin of spectra, described in 
Chap. 9. His theory constituted an extension of Planck’s theory of 
quanta to Rutherford’s nuclear atom, in an attempt both to remove the 
difficulties of the nuclear model and to explain the origin of atomic 
spectra. 


8.6 Radioactive Transformations 


The hypothesis that there exists a genetic relation between some of the 
radioactive clements was advanced by Rutherford and Soddy in 1903, 
on the’basis of their work on the decay and production of chemically 
separable activities. A typical example is afforded by a study of the 
y rays of radium. Radium solutions continually evolve a gas, called 
radon. If the radon gas is completely pumped away from a radium 
source which has been standing in a sealed vessel, it is found that after 


1.0 


Gamma activity 


Days 


Fig. 8.5 Growth and decay of the y-ray activities associated with (1) 
a radium solution from which all radon gas has been pumped and (2) 
the radon gas. (The initial transient due to the growth of Po!® and 
Pb"‘ is exaggerated in the figure; the y rays come from Bi?14,) 
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a short time the gas exhibits all the y activity (measured by absorbing 
out the æ and 8 components), while the solution has none. If the vessels 
containing the radon and the radium are left standing separately for 
a few weeks, it is found that the latter now is y active again and the 
former is not. More detailed measurements of the y-ray activity of the 
two samples as a function of time lead to the curves shown in Fig. 8.5, 
where it may be seen that aside from certain transient effects at the 
very beginning, the activity of the radon falls exponentially, reaching 
half-value in about 3.8 days. The y-ray activity of the radium solution 
rises exponentially, reaching half value in 3.8 days; the sum of the two 
curves is constant in time. That two activity curves relating to chemi- 
cally different substances should bear such close similarity suggests the 
hypothesis that one member (Rn) is produced by spontaneous transmu- 
tation of the other. This is indeed the case. [Actually the y rays in 
this case come not directly from the radon, but from RaC (Bi?!*), which 
appears after three rapid subsequent disintegrations. The least rapid 
of these transmutations has a half-life of 26.8 min, so that there is a 
transient effect of this magnitude in the growth and decay curves of 
Fig. 8.5.] 


a. Theory of Spontaneous Transformations The essential feature of 
the transformation theory lies in the assumption that each radioactive 
nucleus has a definite probability of disintegrating in a given time, a 
probability which depends on the kind of nucleus but which is constant 
for all nuclei of a given kind. If we have N such nuclei, the number 
disintegrating in a time dt is dN = —N) dt, where ) is the disintegration 
constant and the minus sign indicates that N is decreasing with time. 
The number of atoms decaying per unit time —dN/dt is called the 


activity or disintegration rate of the sample. If the sample is isolated so 
that no new atoms are being added, 


= gy =N (8.2a) 
and 
N(t) = N(0)e-™ (8.26) 


where N(0) is the number present at 1 = 0. The time at which half of 
the original atoms have disintegrated, called half-life, is given by 


Nii) 1 _ oy In2 _ 0.693 
NO) 2 
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The mean life, or life expectancy, of the average atom is given by 


ya ae 
i if e™ dt 


>i 


(8.3) 


The behavior of an isolated radon sample with a given initial number of 
atoms is thus accounted for by a simple decay, with disintegration con- 
stant \(Rn) = 0.693/t, = 0.21 X 10-5 s~! 

If now we consider the production of radon in the vessel con- 
taining radium, we have a somewhat different situation. The number 
of radium atoms disintegrating in time dl is dN, = —),N, dl, where \y 
is the disintegration constant of radium. Each of these disintegrations 
produces a radon atom; in the same time, however, a number \2/V> dt of 
radon atoms disintegrate (the subscript 2 refers to radon), so that the 
total change in the amount of radon is 


dN, = +N, dt — X:N: dt = [MN (0e — doo] dt 
If at time t = 0 there is no radon present, the integral of this equation is 


N:D = N:(0) — 


(et — et) (8.4) 


A2— M 


and the ratio of the activities is 


MING Oper Meena fens “ops, 
DRIVEL) oh Aa UTEN (8.5) 


In the present case the parent, radium, has a relatively long half-life 
(1622 years), and so we may ignore ), compared to àz. We then obtain 


Na) = Na) * (1 = ei) (8.52) 


From this relation we see that the growth of radon from radium will 
exhibit a time constant characteristic of the decay of radon, as observed 
in Fig. 8.5. 

When 2z > à (as in this case), a state of transient equilibrium is 
reached when the exponential of Eq. (8.5) becomes negligible. After a 
time long compared to 1/(\2 — M), the number of daughter nuclei in 

` equilibrium with the parent approaches the constant value V2 = Nida/d2} 
i 
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thus 
Node = Nida (8.6) 


This relation, which is general for any radioactive chain provided only 
that the system has been undisturbed for a time long compared with the 
half-life of any of the products, expresses the fact that at equilibrium 
the rate of decay of any radioactive product is equal to its rate of pro- 
duction from the previous member of the chain. A system which has 
reached thjs condition is said to be in secular equilibrium. 

The picture of the radioactive chains which emerged from Ruther- 
ford and Soddy’s considerations is the following. A radioactive atom 
disintegrates by the emission of an «æ particle, thereby losing four units 
of mass and changing its chemical properties. The product then further 
disintegrates either by a or £ emission, each disintegration leading to a 
different element, until finally a stable substance is formed, ending the 
chain. A little later, when the importance of the nuclear charge in 
determining the chemical behavior of an element was realized, it was 
established that the œ transformation removes four units of mass and 
two units of charge, producing an element two steps down in the periodic 
table, while the 6 disintegration removes one negative charge and essen- 
tially no mass, producing an element one step higher in the table. This 
picture has found confirmation in studies of the chemical properties of the 
various products. Although the amounts of material available for such 
studies are ordinarily too minute for conventional analytical techniques, 
the fact that the substances are radioactive makes it possible to carry 
out the usual chemical manipulations and to trace the material under 
investigation through the various stages by means of the radiation 
emitted. Thus, Rutherford and his coworkers were able in 1903 to 
determine the chemical and physical properties of radon gas, including 


its condensation temperature, with samples amounting to small fractions 
of a microgram. 


b. Radioactive Isotopes Further investigations of the radioactive 
transformations led to the identification of a great number of such 
successive transformations and to the discovery of many new radioactive 
substances. A particularly important result of this work was the observa- 
tion for the first time of isotopes, atoms having identical chemical prop- 
erties but differing in atomic weights. The discovery of isotopes came 
incidentally to a search for the origin of radium. The fact that radium 
has a half-life which is short on a geological time scale suggested that 
it must be a product of some longer-lived material, probably uranium, 
since radium is always found in uranium minerals. In the course of a 
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search for this connection, Boltwood discovered in 1906 a long-lived 
substance which appeared to be the intermediate between uranium and 
radium. The interesting feature of his report was that this substance, 
which he named ionium, appeared to be chemically similar to thorium. 
The matter was followed up by Marckwald and Keetman in 1909, and 
later by von Welsbach, with the astonishing result that ionium and 
thorium are chemically identical, although their radioactive properties 
are quite different. Within a year several other such cases were found, 
and it was established that the difference in radioactive properties was 
associated with a difference in atomic weight. The importance of these 
findings was emphasized by Soddy in 1910, when he pointed out that the 
existence among radioactive elements—and, as he presumed to be the 
case, among stable elements as well—of atoms having various masses 
but identical chemical properties provided a natural explanation of the 
nonintegral atomic weights of certain elements and reopened the ques- 
tion of whether all atoms might not be built out of some common 
constituents. 

The hypothesis that the elements are constructed from hydrogen 
atoms had been advanced by the English chemist Prout in 1815, some 
10 years after Dalton’s formulation of the atomic hypothesis, from the 
observation that many atomic weights appeared to be integral multiples 
of that of hydrogen. Prout’s suggestion fell into disrepute, however, 
when with more precise measurements it developed that there were 
deviations from the whole-number relation in many cases. Finally, 
when the atomic weight of chlorine (35.5) was shown to deviate by a fu!l 
half-integer, further discussion of the point was generally abandoned. 
With the discovery of isotopes, such deviations could be accounted for 
by assuming an appropriate mixture. and one could again consider the 
possibility of some elemental constituents of atoms. The masses of 
isotopes are in fact very nearly (though not quite) integral multiples of 
a single number; there is now good evidence that the hydrogen nucleus 
(not the atom) is one of the building blocks of nuclei. 

In modern terminology, each nucleus is characterized by a definite 
charge number Z, a multiple of the protonic charge, and a definite mass 
number A. The mass number is the integer nearest the actual isotopic 
mass, expressed in a scale where the mass of carbon 12 equals 12 exactly. 
The nuclear charge determines the number of electrons in the neutral 
atom and hence the gross atomic properties, but such specifically nuclear 
properties as radioactivity depend just as much on A ason Z. A nuclear 

species with a given Z and A is called a nuclide; nuclides with the same 
Z and different A are isotopes, and those with the same A but different 
Z are called isobars. It is general practice to represent each nuclide 
by a symbol such as ssRa** in which the letters refer to the chemical 
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name, the subscript gives the atomic number and the superscript the 
mass number. The subscript is redundant and is often omitted. 


c. Radioactive Chains Among the heavy elements there are four 
independent radioactive chains (Table 8.1). The first to be investigated 
starts with U? and ends with Pb?°*. The mass number changes only 
by four units from each a decay ; consequently all members of the uranium 


Table 8.1 The four radioactive series 


Thorium Neplunium Uranium Actinium 
(4n) Series (4n + 1) Series (4n + 2) Series (4n + 3) Series 
Une nNp™r nUn Pune 
a 2.39 X 10? y @ 2.25 X 10y a451 X1l0y @ 24,300 y 
Thn: Pat eoTh4 nUn 
a1.39 X 10 y B 27.4 d B 24.5d a 7.07 X 10" y 
aRar nuUum Patt aoTh?t 
B6.7y a 1.63 X 108 y Bll4m B246 h 
apActt® Th! nU nPamı 
66.13 h @ 7000 y a 2.33 X 108 y a3.2 X lty 
soTh?238 uRattt soTh?0 Act? 2ly 
al.90 y B148 d a8.3 X 1l0'y al2% B 98.8 % 
ssRaz?* seActts ssRazt6 Frin Th??? 
a3.64d a 10.0 d a 1622 y 2i m al8.9d 
uRnon s7Fr221 «Rn? ssRat? 
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= 99.99 % | 60.013 % a 0.020 s a 99.97 % B 0.03 % a 3.92s 
Pb: Atie Bin 47m Pb Avs Pot 1830 us 
810.6h | «3004s | a2% | 898% B 26.8 m al3s | «100—-% |85 xX 10-*% 
Bin: 60.5 m Tie Poni Biz 19.7 m Pb Aus 
a337% | 866.3% | 622m | a4.2 pus a 0.04% 699.96 % B 36.1 m a10 ps 
Ta Pon: Pb Tit Pom Bian 2.16 m 
B3.1m a 0.3 ps B33h 6132m a 150 us a 99.68% 60.32% 
s2Pb28 s3Biter sxPb210 b Titer Pom 
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Bino 50d s2Pb207 
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series have mass number A equal to 4n + 2, where n is integer. The 
last to be studied is the neptunium (4n + 1) series, all members of which 
have a half-life so short that only small amounts are found in nature; 
consequently they are artificially produced. As found in minerals which 
have not been subjected to chemical separation, a given chain is in 
secular equilibrium, and the number of atoms of any radioactive product 
present is simply proportional to its half-life. For example, the amount 
of Ra??® per kilogram of U?** in any ore is 


1622 X 226 RS -7 
Eerie x a35 = 34 X 107 kg 
In the chemical reduction of the ore, radium is separated both from the 
uranium and from all other members of the series, and the secular equilib- 
rium is destroyed. In the course of a few days, the next product, Rn, is 
built up again, and with it, all the short-lived products, Po?!8, Pb2!4, 
Bit, Po? The half-life of Pb?!°, however, is 22 years, so that only 


quite old radium samples will contain equilibrium amounts of Pb??? 
Bi?!, and Po? 


d. Half-life of Radium The half-life of such a long-lived substance 
as radium is rather difficult to determine directly; instead, the disinte- 
gration constant is measured by counting the number of disintegrations 
per second from a known number of Ra atoms. In the early experiments 
of Rutherford and Geiger, the number of «œ particles emitted per second 
from the Po? in equilibrium with the radon drawn off a known quantity 
of radium was counted (a particles from the other members of the chain 
have lower energies and can be cut out by an absorbing foil). In a 
later measurement the number of a particles emitted per second by a 
thin weighed deposit of freshly separated radium was determined directly 
and the value corrected for the growth of the decay products during 
the experiment. The number of disintegrations obtained was 3.608 + 
0.028 X 10! & particles per second per gram, giving 


3.608 X 101° X 226.096 


Nia 6.0247 X 10% 


= 1.354 X 10-" s- 


and t, = 0.693/\ = 1622 + 13 years. The standard unit of radio- 
activity is the curie, formerly defined as the amount, i.e., the activity, 
of radon in equilibrium with 1 g of radium. With the general appli- 
cation of the term in recent years to activities of other substances, the 
definition of the curie has been changed. to mean an activity of exactly 
3.7 X 1019 disintegrations per second. Another unit, the rutherford, 
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which was proposed to mean exactly 10° disintegrations per second, jas 
failed of general adoption. 


e. Age of Minerals The half-life of U?38$, determined by the pro- 
cedure just described, is 4.51 X 10° years. As it is reasonably certain 
that U?%* is not itself derived from a longer-lived material, it would 
appear that this substance—and presumably all other elements as well— 
was formed not very many times this number of years ago. It is pos- 
sible to make fairly accurate determinations of the age of uranium- 
containing minerals by several independent methods. To illustrate 
the principle involved, we consider the production of helium from U28, 
For each atom of U**8 which has decayed since the mineral was formed, 
eight helium atoms have been produced, presumably remaining occluded 
in the mineral. If the number of atoms of U*** originally present was 
No and the number present at time ¢ is N, the number of helium atoms 
produced is 


N(He) = 8(No — N) = 8N(e* — 1) 


where } is the disintegration constant of U28, A measurement of the 
ratio of helium to U??8 content then permits calculation of i. In practice 
it is of course necessary to take into account the production of He by 
other radioactive materials present, such as U2*5 and Th28. Also the 
loss of helium is a serious problem in rocks containing high concentrations 
of uranium. Other methods, involving measurement of the isotopic 
composition of the lead in uranium-containing minerals, are less subject 
to this difficulty. The age of the earth, i.e., the time which has elapsed 


since the oldest known minerals were formed, as estimated from such 
determinations, is 4.5 X 10° years. 


Appendix 8A - The Rutherford Scattering Law 


While the earliest application of Rutherford scattering was for a particles, 
the same theory has found wide use in treating the scattering of charged 
particles, such as protons and electrons, by the electrostatic fields of 
other charged centers. Although we treat below the scattering of œ 
particles by nuclei, the equations developed are applicable to coulomb 
scattering for other particles. Let the charge on the incident particle 
be ze and its mass m, while the nucleus bears charge Ze and has a mass 
so much greater than the incident particle that, for a first approxima- 
tion, it may be assumed to remain at rest. Let the « particle approach 
at speed vo along a line passing at a distance b from the nucleus; its energy 
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is thus mv,?/2, and its angular momentum about the nucleus at O is mob. 
Let r and @ be polar coordinates for the a particle with origin at 0, 6 
being measured from a line drawn from O in a direction toward the dis- 
tantly approaching particle (Fig. 8.36), and let è and 6 be the time 
derivatives of r and 6 respectively. 

Then the particle has perpendicular components of velocity * and 
rô, and its angular inomentum about O at any time is mr?6. It also has 
potential energy due to repulsion by the nucleus of magnitude zZe?/4areor 
and initially zero. Thus, adding the kinetic energy, we have from the 
laws of conservation of energy and of angular momentum 


gmit + r303) + ee = 5 mg (8A.1) 


mr? = mvob (8A.2) 


By eliminating d6/dl it follows that 


` Ai 

EF ( E 4 -3 (8A.3a) 
Zi 2 

Ls ae (8A.3b) 


in which the negative sign is chosen because, during the approach, 


dr/di < 0. Dividing this into Eq. (8A.2) and noting that 6/7 = d@/dr, 
we have 


do b 2q _ bA 
d 7 (1 a =) (8A.4) 


The integral of this equation that vanishes at r = ©, as is easily verified 
by substitution in Eq. (8A.4), is 


eet b ENA) ey À 
eee carr rat Md rs 


At the point of closest approach to thr. nucleus, dr/di = 0. Hence, by 
(8A.2), the radical must vanish; then 0 = 0 = 7/2 — cos—! (b/+/b? + g). 
The second half of the path, as the particle recedes, is symmetrical with 
the first half. Hence, the total increase in 0 is 269; and the final deflec- 
tion of the total change in the direction of motion of the particle is 


gm 205 = 2 ent ok (8A.5a) 


VS b+ È 
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therefore 
b=4q cot $ (8A.5b) 
From Eq. (8A.5b) we see that the scattering angle depends on the 
impact parameter b, with large-angle scattering associated with small 6. 
If the impact parameter is increased to b + db, the scattering angle ¢ 
changes by dọ where, by Eq. (8A.56), 


db = do (8A.6) 


q 

— 2 sin? ($/2) 

To find the statistical distribution of the a particles scattered by 
a foil of thickness t which contains n nuclei per unit volume, let No 
a particles per unit area approach the foil along parallel paths normal 
to the foil. Then the number of incident particles per unit area which 
are incident along lines passing at a distance between b andb + dbof some 
nucleus is No(2mb db)nt, provided the foil is thin enough. These parti- 
cles are scattered through angles between ¢ and ¢+ dọ. Hence the 
number of « particles per unit area scattered into the angular cone 
between ¢ and ¢ + dẹ is 


dN = N(¢) dẹ = Nontr E =) see 8 dg (84.7) 


In an experiment one typically measures the number of particles 
scattered at angle which strike a detector of area A a distance R from 
the small area element of a scattering foil. It is assumed that the 
useful foil is small enough for all points to be essentially equidistant 
from the detector. Let the number N; represent the number of æ particles 
incident on the element of foil; clearly, N; is No times the effective area 
of the foil. Of the particles scattered between ¢ and ¢ + dẹ the detector 
intercepts the fraction A/(2rR sin ¢)(Rd¢) (Fig. 8A.1). From Eq. 


A 
AY 


Z__3Rdp 
Fig. 8A.1 
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(8A.7) and the fact that sin ¢ = 2 sin (¢/2) cos (¢/2), we find the num- 
ber Na of a particles reaching the detector to be 


NwntA — z?Z%es 1 
AR? 1677€92m?vo4 sint (¢/2) 


Na = (8A.8) 


Problems 


J. (a) How many a and 8 particles are emitted in the radioactive 
chain starting with „Pu?! and ending with g2Pb2°°? 
Ans: 9, 6 
(b) If one starts with »;Np**’, what nuclide remains after six a and 
three 6 decays? 


Ans: :P0?! 


2. The half-life of s:Bi?!° is 5.0 days. What is the disintegration 
constant? What fraction of an isolated sample remains after 2.0 days? 
10 days? 30 days? What is the mean life of a Bi?!° atom? 
Ans: 0.139 day~'; 0.76; 0.25; 0.015; 7.2 days 

3. Calculate the distance of closest approach of a 5-MeV « particle 
in a head-on collision with a gold nucleus. If the Au nucleus has a 
radius of about 7 X 10-!® m, find the minimum energy which an a 
particle must have to “touch” the Au nucleus if the a particle has a 
radius of ad®ut 2 X 10—15 m. How might one tell experimentally that 


a particles have come close enough to the Au nucleus to bring nuclear 
forces into play? 


Ans: 4.6 X 10-4 m; 25 MeV 


4. If an a particle of kinetic energy K makes a head-on elastic collision 
with a lead nucleus initially at rest, calculate the recoil energy of the Pb 
nucleus as a fraction of K. What is the distance of closest approach, 
assuming that K is not large enough to permit the a particle to touch the 
Pb nucleus. (Remember that at the distance of closest approach the 
center of mass of the system has kinetic energy.) How does this distance 
compare to that one calculates if the Pb nucleus remains at rest? 

5. Show that if a given nucleus, for example, Po2!® in Table 8.1, can 
decay in either of two processes with disintegration constants à and àz, 
N(t) = No-®™+! and the half-life is given by (In 2)/(1 + ào). 

6. If one starts with a sample of pure Bi?! (see Table 8.1) at t = 0, 
find the time at which the £ activity of the daughter T2% will attain 
its maximum value. (Ignore the fact that 0.32 percent of Bi?! does 
not decay to T1?°’.) 


a 
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7. If a sample contains a mixture of two radioactive isotopes, for 
example, F?° and F”, of the same element, show that the ratio r of the 
activity of the shorter-lived isotope to that of the longer-lived is 
r = roe--™), where à is the disintegration constant of the shorter- 
lived isotope. 
8. Radon for medical purposes is pumped off from 5 g of radium every 
48 hours. Calculate how many curies of Rn are obtained from each 
pumping. 

Ans: 1.5 curies 
9. Show that the fraction of the incident a particles scattered through 
an angle greater than ¢ by a thin foil is given by 


10. « particles from a Ra??ê source with energy 4.79 MeV are scattered 
from gold film 0.6 thick. Ifa collimated beam of 4 X 105 æ particles 
per second is incident, the detector screen has an area of 1.2 cm?, and 
the detector is 0.6 m from the foil, calculate the expected counting 
rate for scattering angles of 30, 60, 90, and 135° in a vacuum. 
Ans: 0.15, 0.011, 0.0027, 9.2 X 10~* s7! 

11. (a) In the Thomson atom the positive charge was distributed 
uniformly over a sphere of radius R, the atomic radius. Show that the 
electric intensity inside such a sphere is Zer/4reoR* when r < R. 

(b) Show that an electron at the center of this sphere is in stable 
equilibrium and, if displaced, describes simple harmonic motion of 
frequency 


ee | ZEA 
PEREI Areom R? 
(c) Calculate this frequency for hydrogen and compare it to the 
highest frequency of the hydrogen spectrum. 


Ans: 6.6 X 10:5 Hz, about double the frequency of the Lyman series 
limit 

12. Prove that the maximum possible deflection of an «æ particle by a 
Thomson atom is on the order of 10-4 rad by considering (a) the inter- 
action with one of the electrons in the atom (Hint: Show that the maxi- 
mum speed the electron could get in an elastic collision is twice the 
speed of the a particle and calculate the deflection of the a if the electron 
could have this speed normal to the path of the a) and (b) the inter- 
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action with the positive charge spread uniformly over a sphere of 1 A 
radius. Hint: If R is the radius, the maximum force occurs when the 
a particle is a distance R from the center of the atom; calculate the 
transverse momentum given the a particle if this maximum force acted 
during the entire time the particle needed to go a distance 2R. 


13. An example of transient equilibrium occurs in the decay series 
ssRa??8 £, s4Ac??8 Ê, oTh??8, in which the half-lives are respectively 6.7 
years, 6 hours, and 1.9 years. Show that, when viewed on a time scale 
measured in years, the half-life of the Ac??8 is so short that its activity 
is always equal to that of the Ra?**. If one starts with a sample of pure 
Ra??s, show that the activity of the Th becomes equal to that of the 
Ra in 4.8 years. Verify that the activity of the Th is maximum at 4.8 


years. Show that the activity ratio of the Th to the Ra is 1.39 when 
transient equilibrium is achieved. 


14. If there exists a series of radioactive decays arising from an initially 
pure parent isotope, extend the discussion of Sec. 8.6 to show that 


x] eD saent 
Na() = Ni (O) Nida la =o) (ssn waa) GS Xa) 
ent 


ap (1 = Ma) (A2 — Aa) 


chapter nine 


Spectral Lines and the Bohr Model 


The Rutherford nuclear atom, proposed in 1911, was a great slep for- 
ward. With the exislence of electrons well established and with convincing 
evidence from the Zeeman effect thal electrons were intimalely involved in 
the emission of spectral lines from excited atoms, physicists inevitably 
sought a model of the alom in terms of which one might understand alomic 
spectra and the electron distribution. In 1913 Bohr suggesled a specific 
model of the hydrogen alom which, in spite of certain failings, proved to 
be a major slep toward an understanding of atomic structure. But before 
Bohr’s theory is discussed, it is desirable to know some perlinent facts 
regarding atomic spectra. 


9.1 The Balmer Series 


The foundations of spectral analysis were laid by Kirchhoff and Bunsen 
about 1859. In the next quarter century a wealth of unexplained data 
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on the spectral lines emitted by excited elements was accumulated. 
As soon as dependable wavelength measurements became available, 
numerous investigators, reasoning from the analogy of overtones in 
acoustics, sought for harmonic relations in the lines found in the spectrum 
of a given element. This search proved fruitless, but certain relations 
of a different type were discovered. Liveing and Dewar, about 1880, 
emphasized the physical similarities occurring in the spectra of such 
elements as the alkali metals. They called attention to the successive 


suggesting some kind of series relation, which, 
unable to discover. In 1883, Hartley discovered an 
relationship between the components of doublets or triplets in the spec- 
If frequencies, instead of wavelengths, are 
difference in frequency belween the components 
r triplet, in a particular spectrum is the same 
nes in that spectrum. 


marked by crosses. This sam 
overlapping series of triplets, which are designated by circles. 
out these related lines required a great deal of diligent and 
study. 

The beginning of our knowledge of spectral-series formulas dates 
from the discovery by Balmer, in 1885, that the wavelengths of the nine 


then known lines in the spectrum of hydrogen could be expressed very 
closely by the simple formula 


Sorting 
patient 


Å (9.1) 


where n is a variable integer which takes on the successive values 3, 4, 


2500 2600 2800 


Wavelength, A 


Fig. 9.1 Some lines in the ultraviolet spectrum of zinc. The lines marked 
Se lishong to one series of triplets, lines marked © belong to another series. 
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Pfund series __ 


6563 


series 


(a) b) (c) 


Fig. 9.2 (a) The spectrum of atomic hydrogen consists of the Lyman series in 
the ultraviolet, the Balmer series in the visible, and several series in the infrared 
region. (b) The Balmer series in greater detail. (c) A portion of the spectrum 
of the star ¢ Tauri showing more than 20 lines of the Balmer series. (From A. W. 
Smith and J. N. Cooper, “ Elements of Physics,” Tth ed. Copyright 1964. McGraw- 
Hill Book Company. Used by permission.) ý 


5, :. . for, respectively, the first (beginning at the red), second, 
third, . . . line in the spectrum. Figure 9.2b and c shows the Balmer 
lines of hydrogen. 

In 1906 Lyman studied the ultraviolet spectrum of hydrogen and 
found a series which now bears his name, and 2 years later Paschen 
found a hydrogen series in the near infrared. These series are indicated 
in Fig. 9.2a. Formulas similar to that for the Balmer series predict the 
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positions of the lines of these series and will be developed from the 
Bohr theory in Sec. 9.4. 

The impetus which Balmer’s discovery gave to work in spectral 
series is another illustration of the highly convincing nature cf relations 
which are expressible in quantitative form. Soon after the publication 
of his work, intensive investigations in spectral series were initiated by 
Kayser and Runge and by Rydberg. It soon became evident that 
frequency v is more fundamental than wavelength. We do not, however, 
measure frequencies directly; laboratory measurements yield wave- 
lengths. Therefore it is customary in spectroscopy to use, instead of 
the frequency itself, the wavenumber, or number of waves per unit length 
in vacuum, which we shall denote by ý. Thus = 1/Aya. (The wave- 
length in air is less than that in vacuum for visible light by somewhat 
less than 1 part in 3000.) For more than a century it has been customary 
to express wavenumbers in waves per cenlimeler, and we shall follow 
this practice. 


9.2 Spectral Series and Their Interrelations 


In announcing his discovery, Balmer had raised the question whether 
his formula might be a special case of a more general formula applicable 
to other series of lines in other elements. Rydberg set out to find such a 
formula. Using the comparatively large mass of wavelength data then 
available for alkalis, Rydberg isolated other series of doublets of con- 
stant frequency difference, according to Hartley’s law of constant, wave- 
number separation. In all cases, these series showed a tendency to 
converge to some limit in the ultraviolet. He could distinguish two 
types of such series: a type in which the lines were comparatively sharp, 
which he called, therefore, sharp series; and a type which, because the 
lines were comparatively broad, he called diffuse series. Both types 
occurred in the arc spectrum of the same element. In many arc spectra, 
he found also a third type of series in which the doublet spacing decreased 
as the frequency or ordinal number of the line increased, as if tending 
to vanish at the convergence limit of the series ; these he called principal 
series, because they commonly contained the brightest and most per- 
sistent lines in the spectrum, By “line” in this connection is meant 
a complex of actual lines forming a doublet, which is seen as a single 
line in a spectroscope of moderate resolving power. The chief lines of 
the principal, sharp, and diffuse series of sodium are plotted in Fig- 9.3, 
marked p, s, d, respectively. 


Eventually Rydberg found that many observed series could be 


——— 
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9000 8000 7000 6000 5000 4000 3000 2000 


Fig. 9.3 Lines in the spectrum of neutral sodium: p = principal, s = sha n, 
= diffuse series. Dotted lines indicate series limits. The first line of the shar, 
series at 11,393 A is not shown. 


represented closely by an equation of the form 


a eo (9.2) 


(m + p’) 


p and ře being constants which vary from one series to another. Obvi- 
ously, by properly choosing the ordinal number m of the lines, » can 
always be made less than 1 in such a formula. The constant ïa represents 
the high-frequency limit to which the lines in the series ultimately con- 
verge. The Balmer formula is a special case of this more general Ryd- 
berg formula; for Eq. (9.2) can be written 


X~ 3 bmi =~ WR (9.2a) 


with %, = 1/b, R = 4/b, m = 3, 4, 5,.... This is of the type of 
Eq. (9.2) with » = 0. 

The constant R in Eq. (9.2), now called the Rydberg constant, was 
found to have the same value for a large group of series for each substance, 
and very nearly the same for all substances. Its slight variation from one 
atom to another is now known to be due to differences in the atomic 
weight, the effect of which can be calculated theoretically (Sec. 9.4). 
It is thus possible to calculate what R would be for an atom of infinite 
mass; this value is denoted by R., the value for a particular kind of 
atom being denoted by another appropriate suffix. The values as found 
from spectroscopic observation are: 


For hydrogen aloms, H! 


Ru = 109,677.58 cm~! = 1.0967758 X 107 m-t 
For deulerium aloms, H? 

Rp = 109,707.42 cm~! 
For helium, Het 

Ru. = 109,722.27 cm~! 
For atoms of infinite mass 

Re = 109,737.31 cm-! 


232 Introduction to Modern Physics 


9.3 Spectral Terms 


Rydberg noticed also several remarkable relationships between different 
ectral series belonging to the same element. He noted that the sharp 
ae the diffuse series have a common convergence limit. Next, he observed 
that this common limit was equal to a lerm in the formula for the prin- 
cipal series, viz., the term R/(m + p)? with m set equal to 1, and that 
a similar equality held between the limit of the principal series itself 
and the variable term, for m = 1, in the formula for the sharp series. 
Because of these remarkable relations, the formulas for the three 
series in question could be rewritten in the following form: 


p R R 
Vp = O+S (m+P) Ol RES (9.3a) 
$ R R 
e OE S T 2,3,... (9.3b) 
È R R 
EE BE maD ™=%3,.-- (9.30) 


Here we have indicated that, at least for the alkali metals, m starts from 1 
in the principal series but from 2 in the others. 

Two additional relations are apparent from the formulas. If we 
set m = 1 in the formula for ñ., we obtain the same number as the 
value of ï, for m = 1, but with reversed sign! Furthermore, the differ- 
ence between the limit of the principal series and the common limit of the 
other two series is just the wavenumber of the first line of the principal 
series (the Rydberg-Schuster law, enunciated in 1896 by Rydberg and 
independently in the same year by Schuster). 

These formulas suggested to Rydberg the possibility that the first 
term on the right might also vary, in the same manner as the second, 
thus giving rise to additional series of lines; e.g., we might expect to 
find a series represented by the formula 


e ER 
(2 + S)? (m + P)? m=3,4,... 


Lines or series of this sort were discovered later by Ritz. Such lines 
are called inlercombinalion lines or series, and the possibility of their 
occurrence is known as the Ritz combination principle. Many examples 
of them are now known. 


The most significant features about atomic spectra thus seem to be 
the following: 
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1. The wavenumber of each line is conveniently represented as the differ- 
ence between two numbers. These numbers have come to be called lerms. 

2. The terms group themselves naturally into ordered sequences, the terms 
of each sequence converging toward zero. 

3. The terms can be combined in various ways to give the wavenumbers of 
spectral lines. 

4. A series of lines, all having similar character, results from the combina- 
tion of all terms of pne sequence in succession with a fixed term of another 
sequence. Series formed in this manner haye wavenumbers which, when 
arranged in order of increasing magnitude, converge to an upper limit. 


The simple picture presented here requires considerable extension. 
In writing Rydberg formulas for spectral “lines,” we have ignored the 
fine structure of the lites, by means of which series were first picked 
out; singlet series, in which each line is single, are known in many ele- 
ments, but more commonly the lines form doublets, triplets, or groups 
of even more components. In such cases, a separate Rydberg formula 
must be written for eaoh component line. 


9.4 The Bohr Theory 


Niels Bohr, working/ in Rutherford’s laboratory at Manchester in 1913, 
succeeded in combining Rutherford’s nuclear atom with Planck’s quanta 
and Einstein’s phétons in a way that predicted the observed spectral 
lines of hydrogen, It had long been recognized that coulomb electro- 
static attraction/ identical with gravitational attraction in spatial prop- 
erties, would lead to stable circular or elliptical orbits for an electron 
bound to a positive nucleus if this were the only force involved; but 
classical electfomagnetic theory predicts that an accelerated charge 
radiates epey and, as a consequence, would quickly spiral into the 
nucleus. - owever, Planck's original quantum theory (Sec. 5.6) involved 
two essential features: 


1. An oscillator can exist only in one of a number of discrete quantum states, 
and to each of these states there corresponds a definite allowed value of 
its energy. 

2. No radiation is emitted while the oscillator remains in one of its quantum 
states, but it is capable of Jumping from one quantum state into another 
one of lower energy, the energy lost in doing so being emitted in the form 
of a pulse or quantum of radiation. 


Successful applications of the first of these assumptions had already 
been made in other fields, notably by Einstein and especially by Debye 
in the specific heat of solids, which will be discussed in Chap. 20. Nichol- 
son had also attempted to apply the theory to spectra, and with some 
success, but he was unable to make it yield a series of lines converging to 
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alimit. Bohr discovered how to apply similar ideas toa hydrogen atom 
of the Rutherford type and succeeded in arriving at a theoretical formula 
for its spectrum that agrees with observation. 

Bohr assumed that an electron in the field of a nucleus was not 
capable of moving along every one of the paths that were possible accord- 
ing to classical theory but was restricted to one of a certain set of allowed 
paths. While it was so moving, he assumed that it did not radiate 
contrary to the conclusion from classical theory, so that its ene 
remained constant; but he assumed that the electron could jump 
one allowed path to another one of lower energy and that when \t-did 
so, radiation was emitted containing energy equal to the difference in 
the energies corresponding to the two paths. 2 

As to the frequency of the emitted radiation, he consideted several 
alternative hypotheses but finally adopted the same sumption that 
Planck had made for his oscillators. That is, if E; and Æ; are the 
energies of the atom when moving in its initial and final paths, respec- 


tively, the frequency v of the radiation emitted is determined by the 


condition that \ 


hv = E; — E; (9.4) 
where h is Planck’s constant. This assumption had the adäitional 
advantage of agreeing with that made by Einstein in arriving \at his 
highly successful photoelectric equation (Sec. 6.7). For the latter reason, 
Eq. (9.4) came later to be known as the Einslein frequency condition. 

Concerning the formulation of the condition that determines the 
allowed paths, Bohr was also in doubt. While the electron remains in 
one of its slalionary slates, as he called them, he supposed it to revolve 
in a circular or elliptical orbit about the nucleus in accordance with the 
classical laws of mechanics. But what fixes the size and shape of this 
orbit? In the'end, he postulated that the orbit is a circle, with the 
nucleus at its center, and of such size that the angular momentum of the 
electron about the nucleus is an integral multiple of the nalural unit h/2r, 
which we write à. (This is not true, as we shall see subsequently.) 

The allowed values of the energies of the stationary states and of 
the spectral lines expected on the basis of Bohr’s theory are easily deduced 
for a system consisting of a single electron and a nucleus of charge Ze 

from his three basic assumptions: 


1. The electron revolves in a circular orbit with the centripetal force 
supplied by the coulomb interaction between the electron and the nucleus, 
Ze Y T n 


= m% = mor (9.5) 
Areo? r 
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where r = radius of allowed orbit 
v = electron velocity 
w = corresponding angular velocity 
2. The angular momentum A of the electron takes on only values 
which are integer multiples of h. 


1S 


A = mr = mor? = nh (9.6) 


3. When an electron makes a transition from one allowed stationary 
stale lo another, the Einslein frequency condition hy = E; — E; is satisfied. 

Eliminating v (or w) from Eqs. (9.5) and (9.6), we find the allowed 
radii for the orbits to be (Fig. 9.4) 


h?n? 
meZ 


(9.7) 


Tn = Areo 


Thus the radii of successive allowed orbits are proportional to n?, or to 
1, 4,9,16... . For hydrogen Z = 1, and the radius of the smallest 
allowed circle, known as the Bohr radius, is given by 


2 
as =n(Z =1,n=1) = deo = 0.5292 X 10-9m (9.8) 


The diameter of the orbit is thus close to 1 Å, which agrees well with 
estimates from kinetic theory. This is a first indication that the new 
theory may be able to explain the apparent siżes of molecules. 


Z=2 
©; 


3 Ionized helium 


Hydrogen 
Z=1 


Z=3 
Balmer series 
B Paschen N, 
series 
: Dovbly ienized 
lithium 


Fig. 9.4 The smallest Bohr orbits for three atoms, drawn to a 
common scale. 
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By Eq. (9.6), we have v = nh/mr, and so the velocity of the elec- 
tron in the nth orbit is 


Bee ae ee 9.9 
Ge Areo hn 2eohn (9) 


The ratio a of the speed of the electron in the first Bohr orbit of hydrogen 
to the speed of light c is known as the fine-structure constant; its value is 
given by 


Lsj 9.10 
= Fneoħhe 137.0377 (9.10) 


The energy of the electron is partly kinetic and partly potential. 
If we call the energy zero when the electron is at rest at infinity, its 
potential energy in the presence of the nucleus is 


Its kinetic energy is 


Ze? 
K = im? = mrw? = 
3 3 Areo 2r 


by (9.5). The total energy is Æ = K + P, and so for the nth state 


1 Ze? 1 \? mez? mesZ? 
En Age ory, (a) Den? Behn? (9.11) 


Our calculation requires a certain correction. What we have 
developed is a fixed-nucleus theory. In reality, if the electron revolves 
at a fixed distance from a nucleus of mass M, its path in space is a circle 
of radius a’ (Fig. 9.5) about the cenler of mass of the combined system. 
At the same time the nucleus revolves in a smaller circle of radius a’, 
where r = a’ +a”. The effect of this correction is to replace the mass 


M Center of mass m 
@ e ) 
pe E e | 
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=e 
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Fig. 9.6 The lower energy levels of hydrogen and a few 
of the lines arising from transitions between the levels. 
Potential differences between levels are shown at the 


right. 
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m by the reduced mass 


m 


ie = erp yh 


(9.12) 


If we insert appropriate values for the constants of Eq. (9.11), we 
have 


2.176 X 10—87? J 
er GHA ae 


(9.1la) 
13.58Z? eV 


E, = 


Thus an electron bound to a proton can have any one of a discrele set 
of allowed energies (Fig. 9.6). In addition, Bohr assumed that an unbound 
electron near the proton could move with any value whalever of positive 
energy. In such a case the orbit would be a hyperbola. Consequently, 
we have, in all, a discrete sel of allowed negative energies, or energy levels, 
converging to the value zero and, from zero up, a continuum of allowed 
positive energies. Since the zero value of the energy corresponds to the 
electron at rest at infinity, the numerical value of the (negalive) energy of 
the normal stale also represents the least energy required lo ionize the alom 
by complete removal of the electron. For hydrogen this ionization energy 
is 13.58 eV. 

A final remark should be added to forestall misunderstanding. 
The orbital angular momenta assigned to the various quantum states 
in the Bohr theory are not the correct ones, as we shall see in Chap. 14. 
The Bohr theory does, however, lead to the observed energy levels 
(except for a minor correction). 


9.5 The Spectrum of Atomic Hydrogen 


When an electron makes a radiative transition from an initial state 
n; to a lower final state ny, the resulting photon has, by Eq. (9.4), energy 


1 \? met f 1 1 
hv = (z) DA? (4 -= 5) Z? (9.13) 


and wavenumber ř (number of waves per unit length) 


iE re nome Rb Z (9.14) 
P= Xo Bethe \nj? 1? mi 
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where R = m,e‘/8eo*h?c is the Rydberg constant. Bohr’s calculation of 
R in terms of fundamental constants was a dramatic triumph of his 
theory. 

' For hydrogen with Z = 1 the energies and wavenumbers of the 
Lyman series photons are given by Eqs. (9.13) and (9.14), respectively, 
by setting n; = 1 and letting n; take values 2, 3,4, .... Since n=1 
corresponds to the lowest allowed energy state, or ground slale, of hydro- 
gen, we see that the lines of the Lyman series arise from electron transi- 
tions from higher, or excited, states to the ground state (Fig. 9.5). 

The Balmer series arises from electron transitions to the state with 
ny = 2 from initial states characterized by n; = 3, 4, 5,.... Simi- 
larly, the Paschen series results from transitions from states with 
n; = 4, 5, 6, . . . toa final state ny = 3. The Bohr theory successfully 
predicted the wavelengths of all atomic-hydrogen lines which had been 
discovered and of lines yet to be observed. The Brackett series, found 
in 1922, lies in the infrared and is associated with transitions with ny = 4, 
In 1924 the Pfund series with ny = 5 was discovered in the far infrared. 


9.6 Ionized Helium 


A helium atom which has lost a single electron resembles a hydrogen 
atom, except that Z = 2 and the nucleus is nearly four times as heavy. 
The spectrum emitted by such atoms is known as the spark spectrum 
of helium (or the helium II spectrum), because it is emitted much more 
strongly when the helium is excited by a spark than when it is excited 
by an arc. The are spectrum of helium, emitted by the neutral atom, 
will be considered later (Sec. 17.4). 

Putting Z = 2 in Eq. (9.14), we see that ionized helium should emit 


the same spectrum as hydrogen except that all frequencies are four limes as 
great, or all wavelengths a quarter as great. 


This conclusion from the theory agrees with observation except for 
a slight numerical discrepancy due to the fact that the reduced mass 
m, is slightly greater for He than for H. So precisely can the wave- 
lengths of spectral lines be measured that the experimental value of the 
Rydberg constant for hydrogen and for helium can be used to make 
one of the better determinations of the ratio of the mass of the proton to 
the mass of the electron. 

Ina similar way, doubly ionized lithium is found to emit the hydrogen 
spectrum with all frequencies multiplied (almost exactly) by 9; trebly 
ionized beryllium emits them increased in the ratio 16; and so on. The 
first line of the Lyman series for quadruply ionized boron (Z = 5) has 
been found by Edlen at 48.585 A, with a frequency 25.04 times that 
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of the first Lyman line of hydrogen. The similarity of spectra of atoms 
and ions which have identical numbers of electrons extends throughout 
the periodic table. The term isoelectronic sequence is used to refer to a 
sequence of atoms and ions having the same number of electrons; thus 
H, Het, Lit+, Be*+, B+, and so forth constitute the one-electron isoelec- 
tronic sequence. 


9.7 Moseley’s Law 


In 1913, the same year in which Bohr published his theory and Bragg 
reported the development of the x-ray spectrometer, Moseley undertook 
the systematic study of characteristic x-rays discussed in Sec. 7.7. In 
searching for a relation between the frequency of the Ka line and some 
property of the atom in which the line originated, Moseley first observed 
that the frequency did not vary uniformly with the atomic weight 
(curve A, Fig. 7.10). In Bohr’s theory, however, which had just been 
proposed, the charge on the nucleus played a fundamental role. Accord- 


ing to Eq. (9.13) the frequency of a Spectral line for a one-electron atom 
is given by 


metZ? f 1 1 
= mez: (G ‘h a) (9.13) 


If we set ny = 1 and n; = 2 in Eq. (9.13a), we obtain 
v = 0.246 X 106Z? Hz 


Except for the slight correction to Z, this equation is almost identical 
to Eq. (7.2). This agreement suggests that the Ka line arises from an 
electron transition from an initial state n; = 2 to a final state ny =1. 

It seems remarkable that a formula derived for a one-electron 
atom should be applicable to an atom such as copper with its 29 elec- 
tron However, as we shall see in Chap. 15, we may think of an atom 
as ia. ing at most two electrons with n = 1. To excite K x-ray lines 
one of these electrons must be somehow ejected. If an electron from a 
state with n = 2 makes a radiative transition to fill this vacancy, a Ka 
photon is emitted. In terms of the Bohr picture the other electrons are 
in orbits of greater radius and therefore affect the radiated energy little. 

A Moseley plot of v^ as a function of Z for the KB line had a 
somewhat larger slope than that for the Ka line, but roughly the same 
i on the atomic-number abscissa. Moseley found that the slope 
intercep 3 dicted by Eq. (9.13a) with n; = 1 and n; = 3. 
agreed well with that pre y 
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Thus within a year of its publication the Bohr theory aclseved another 
great success. 


9.8 The Bohr Magneton 


An electron moving in a Bohr orbit should possess a magnetic moment. 
A loop bearing a current I has a magnetic moment y equal to the product 
of I and the area of the loop. For the nth Bohr orbit the area is zr,,?, 
while the current (charge passing per unit time) is —e multiplied by 
the number of revolutions made by the electrons per unit time. Thus 


u= Ifar) = =" ary? = Fw (9.15) 


where v is the velocity of the electron. If we multiply both numerator 
and denominator by m and recall that mer, is the orbital angular momen- 
tum, we have 


u = a At (9.16) 


where w is the vector magnetic moment and A; the vector angular 
momentum and the negative sign shows these two vectors to be in 
Opposite directions for a negative revolving charge (Fig. 9.7). The 
subscript l is used here and later to designate “orbital” to distinguish 
it from the corresponding quantities we shall associate with electron 


spin (Sec. 9.9). The ratio w/A (= —e/2m) is called the gyromagnelic 
ralio for orbital motion. 


Fig. 9.7 The orbital 
and spin angular mo- 
menta of an electron 


with the associated 
magnetic moments, 
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For a Bohr hydrogen atom in its ground state n = 1, and by 
Eq. (9.6) (A)n-1 = h. The magnetic moment predicted by Eq. (9.16) 
for this orbit is called the Bohr magneton, given by 


HB om (9.17) 


It has the numerical value 9.27 X 10774 A-m?. 

Although quantum mechanics shows that the orbital motion of a 
hydrogen electron in its ground state contributes no magnetic moment, 
it turns out that electronic magnetic moments are conveniently expressed 
in units of Bohr magnetons, and we shall use them repéatedly in later 
chapters. 

The following feature of classical electromagnetics is of interest. 
Suppose that the magnetic moment y of a body is proportional to its 
vector angular momentum A about its center of mass. Draw OP (Fig. 
9.8) to represent A, making an angle @ with an axis OB drawn in the 
direction of a magnetic field B. Then, during time di, the torque on 
the magnet will add to A a vector increment dA of magnitude 4B sin 0 dl 
in a direction perpendicular to the plane POB. . This increment will 
change the direction of A but not its magnitude, so that A precesses 
about OB in such a way that the end of the vector OP revolves in a 
circle of radius A sin 0; since during dt it revolves through an angle 
dA/(A sin 6) = „B sin 8 dt/(A sin 6) or uB dt/A, it revolves at an angular 


Fig. 9.8 Precession of a mag- 
wate gyro in a magnetic field. 
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rate 
Cr rad/s (9.18) 


For an electron in orbital motion, its orbital angular momentum 
A, of magnitude —2mp/e by Eq. (9.16) will thus precess at the numerical 
rate 


eg (9.19) 


The angular frequency « is known as the Larmor precessional frequency. 
According to a theorem due to Larmor, the introduction of a magnetic 
field modifies the classical motion of a set of electrons in the field of a 
nucleus chiefly as if a uniform rotation at angular frequency w were 
superposed upon their original motion. The angular momentum of the 
electrons about an axis parallel to B is thereby slightly altered, as a 
result of inductive action during the production of the field, but this 
change is so small compared to the values of A; due to orbital motion 
that it may be ignored in the present connection. 


9.9 Electron Spin 


In 1925 Goudsmit and Uhlenbeck pointed out that certain features in 
atomic spectra, such as the doublet fine structure of the alkali elements, 
could be explained if it was assumed that the electron “spins” about an 
axis through its center of mass and that it has both angular momentum 
and a magnelic moment associated with this rotation. In the Bohr model 
with spin the electron is closely analogous to the earth, which rotates 
about its axis each sidereal day and revolves about the sun each year. 
The spin angular momentum A, of the electron has a constant magnitude 


A,=ViG@h=Vs(s4+1)a (9.20) 
where s has the value 3. 
Any measurement of the component of the spin angular momentum 
parallel to some prescribed z axis yields the result that 


(A): = mh = +34 (9.20a) 


where m, is called the spin quantum number, for which the allowed values 
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are 4 and —}. Thus, by virtue of its spin, the electron possesses two 
internal quantum states, characterized by the two values of m,. 

Associated with the spin angular momentum A, there is a magnetic 
moment 


Be = gg Ae = -26. (2) ) àl, (9:21) 


where g. is a number’ known as the spin g factor and 1, is a unit vector 
in the direction of A.. If a magnetic field defines the z direction, the 
quantization rule, Eq. (9.20a), requires that the z component of u, have 
one of the two values 


À 
U) = x2 eka F (9.21a) 


Comparison with Eq. (9.17) shows that this is essentially 1 Bohr magneton. 
The interaction of the spin magnetic moment of an electron with the 
magnetic field arising from its orbital motion plays an important role in 
the fine structure of atomic energy levels. 


“9.10 The Bohr Correspondence Principle 
| Bohr’s theory represents a remarkable combination of principles 
| taken over from classical theory with postulates radically at variance 
with that theory. He solved the old problem of stability merely by 
postulating that the cause of instability, the emission of radiation and 
‘the accompanying radiation reaction, did not exist so long as the 
electron remained in one of its allowed stationary orbits. The elec- 
tron could thus remain in its stationary stat2 of lowest energy indefi- 
_nitely, without spiraling down into the nucleus, as classical theory 
| would require. But the problem of stability was solved only at the 
expense of throwing away the only picture we had of the mechanism 
by which the atom could emit spectral lines. For Bohr's postulates 
provide no picture of the sequence of events during transitions between 
? RN hybrid theory of this sort was widely felt to be unsatisfactory, 
| but it was astonishingly successful. Later we describe the wave- 
mechanical theory that has replaced the Bohr theory. This theory 
“Jeads very nearly to the same set of energy values for the hydrogen 


1 Actually, ge = 2.00230; see Sec. 18.3. 
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atom, but it suggests a quite different picture of the behavior of the 
electron while in a quantum state. In this theory, it is only about 
half true that the electron is in motion in the atom, even when it is in 
one of its higher quantum states; at least, it cannot be said to follow 
a definite orbit. Because of the abstractness of wave mechanics the 
original simple Bohr picture retains something more than mere his- 
torical interest, and it offers a bridge between classical physics and 
modern quantum mechanics. 

Classically an electron in a circular orbit radiates at the frequency 
of its motion. In Bohr’s theory the emitted spectral frequencies are 
distinct from the frequency of orbital revolution of the clectron. This 
latter frequency can be found by solving Eqs. (9.5) and (9.6) for w, 
from which 


orb = 9 — eth? n? i (9.22) 


The emitted frequency for a transition between states n; and ny is 
given by Eq. (9.13a), which may be written 


It can be shown that, since n; > ny, 


Lent i 23 
ne < rany? < np (9.23) 


Hence, when n; — ny = 1, the frequency » of the emitted radiation 
is intermediate between the frequencies of orbital revolution in the 
initial and final states; for large n the emitted and orbital fre- 
quencies are almost the same. Furthermore, putting in succession — 
ni — ny = 2,3, . . . , we have an approximation to various harmonic 
overtones of a fundamental frequency. Thus, quantum jumps in 
which An > 1 correspond to the overtones in the case of classical vibra- 
tions. The fact that tn the limit characlerized by large quantum num- 
bers classical physics and quantum physics lead to the same predictions 
was pointed out by Bohr in 1923 and is known as the Bohr correspond- 
ence principle. The requirement that the prediction of any quantum- 
mechanics calculation reduce to that of elassical physics in the appro- 
priate limit was a great heuristic aid in the early development of 
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| Jished. - It continues to be a useful test for some theories. 
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“9.1 Extension of Bohr’s Theory - 


Vie his original paper, Bohr remarked that the orbit of the electron - 
“in a hydrogen atom might be an ellipse instead of acircle. A detailed 
theory of elliptical orbits was developed by Sommerfeld several years 
Tater. The geometrical ideas involved still possess a certain interest. 
According to the laws governing motion under an inverse-square 
force, an elliptical orbit has one of its foci at the center of mass and the 
“energy of the system depends only on the major axis of the ellipse. 
The orbit lies in a fixed plane, so that the motion can be described by 
means of polar coordinates r, 6, with the origin at the nucleus. Then, 
as 0 increases through 27, r increases from its minimum to a maximum, 

after which it decreases again to the minimum (Fig. 9.9). 

| To select from the infinity of classically allowed ellipses Som- 
merfeld generalized the quantum condition which had been used by 
Planck and by Bohr. We saw in Sec. 5.6 that Planck assumed that 
` the ellipse arising from a plot of p as a function of x for a harmonic 
oscillator (Fig. 5.5) had an area equal to an integer timesh. This area 
may be represented as $'p dz, where $ means integration over a com- 
plete cycle. Therefore, Planck’s condition for a quantum state of 

an oscillator can be expressed as 


$p dz = nh (9.24) 
This rule is easily modified to apply to a Bohr orbit. If we choose 
as coordinate of the electron its angular displacement @ (Fig. 9.9), 
the generalized momentum pp corresponding to 6 is the angular momen- 


Beas) electron revolving in an 
Led eh about a proton at one 
focus: pita 
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tum mor, and we see that pee 
Spy do = J." mor do = nh ; (9.28) 


gives just the Bohr condition, Eq. (9.6). Wilson (1915) and Som- 
merfeld (1916) independently concluded that for any generalized 
coordinate q and its conjugate momentum p,, the quantum condition 


Spo dq = nh (9.26) 


was applicable. This condition, known as the Wilson-Sommerfeld 
quantization rule, was widely used in the expansion of the older quan- 
tum theory. (It is now known that it is not a generally valid con- 
dition.) For his elliptic orbits Sommerfeld therefore took 


Sp. dr = Wh (9.27a) | 
Spo dd = kh (9.27) 


where the integers n’ and k are known respectively as the radial and 
azimuthal quantum numbers. 

Sommerfeld assumed that n’ could be 0, 1, 2, 3, . . . while the 
azimuthal quantum number k took on values 1, 2, 3,..., with 0 
excluded since that would correspond to harmonic oscillation of the 
electron directly through the nucleus. He showed that the orbits 
allowed by Eqs. (9.27a, b) had semimajor axes (n’ + k)*az/Z and 
semiminor axes (n’ + k)kas/Z, where as is the Bohr radius. The 
allowed energies in the nonrelativistic case depended only on the total 
quantum number n(= n’ + k) and are the same as those of the Bohr 
theory. Thus up to this point nothing is gained except a greater 
variety of orbital shapes. As an example, forn = 3, we can have the 
Bohr circle with k = 3 and n’ = 0 or either one of two ellipses with 
major axis equal to the diameter of the circle, with k = 2, no =1 
or k = 1, n’ = 2, respectively. These orbits are shown to scale in 
Fig. 9.10 labeled also with wave-mechanical and Spectroscopic symbols 
(Sec. 15.1). 

Sommerfeld then showed, however, that, if allowance is made 
for the known variation of electronic mass with speed, the energy of the 
elliptical motion is slightly different from that of the circular motion, — 
and in this way he arriyed at a splitting, or fine structure, of the levele 
of one-electron atoms, which appeared to be in quantitative agreement. 
with observation. However, the ‘idea of sharp orbits is not in accord | 
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k=3 (0 =2) 


Fig. 9.10 The Bohr-Sommerfeld orbits for n = 3. 


with modern quantum mechanics, which we believe to be the correct 
approach and which is capable of predicting the hydrogen fine structure 
in much greater detail than Sommerfeld’s historic elliptic orbits could. 
In the meantime, Bohr and others were endeavoring to extend the 
theory to atoms containing more then one electron. Little quantita- 
tive success was achieved in this direction. In particular, no plausible 
arrangement of orbits could be devised for the two electrons of helium 
which would give the correct value for the first ionization potential 
(24.6 V). In a broader way, Bohr endeavored to understand the 
known variation of chemical and physical properties from atom to 
atom through the periodic table, but without great success. The truth 
is that two principles essential for the understanding of complex atoms 
were at that time unknown—the Pauli exclusion principle and the 
phenomenon of electron spin. Accordingly, we shall follow the old 
quantum theory no further. 


Problems 


1. Calculate the wavelengths of the first two lines and of the series 
limit for the Balmer series of ionized helium. 

Ans: 164, 122, 91 my 
2. Find the energies of the photons and the wavelengths of the first 
three lines of the Paschen series for hydrogen. 

Ans: 0.66 eV, 1.88 u; 0.97 eV, 1.28 u; 1.13 eV, 1.09 » 
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3. How many revolutions does an electron in the first Bohr orbit of 
hydrogen make each second? To what current does this correspond? 
Find the magnetic induction at the proton due to the motion of the 
electron. 

Ans: 6.6 X 105 rps; 1.05 mA; 12.5 Wb/m? 
4, Hydrogen atoms are in an excited state with n = 5. In terms of 
the Bohr model how many spectral lines can be radiated as these atoms 
return to the ground state? Find the energies of the lines with the 
longest and shortest wavelengtlis. 
5. What is the binding energy of the single electron of a Bt+ ion? 
Find the wavelength of the photon emitted by such an ion when the elec- 
tron goes from the first excited state to the ground state. 
6. Find the energy in electron volts required to strip a calcium atom 
(Z = 20) of its last electron, assuming the other 19 have been removed. 
How does this compare with the energy required to excite the K x-ray 
lines of Ca, which is about 3.7 keV? Why the difference? 


Ans: 5.4 keV; other electrons change potential 


7. Find the frequency of a photon emitted when a hydrogen atom 
makes a transition from a state with n = 30 to one with n = 29. Find 
the fractional difference between this frequency and the orbital frequency 
of the electron in its initial state. 


8. Hydrogen atoms are initially in the n = 4 energy level. How many 
different photon energies will be emitted if the atoms go to the ground 
state, with all possible transitions represented? Assuming that from any 
given excited state all possible downward transitions are equally probable 
(not true!), what is the total number of photons emitted if there are 
600 atoms initially in the n = 4 state? 

9. Calculate the energy and wavelength of a photon which is emitted 
when a hydrogen atom initially at rest makes a transition between states 
characterized by n = 6 and n = 1. Find the speed with which the H 
atom recoils. 

10. What transition in singly ionized helium leads to a line with almost 
the same wavelength as that of the first line of the hydrogen Lyman 
series? Using the appropriate Rydberg constants, find the wavelength 
difference between these two lines. Which of the lines has the shorter 
wavelength? 

11. Show that, on the basis of the approximations which led to Eq. 
(9.13a), the slope of the Moseley plot for K8 is expected to be +/33 
times that for Ka. 


12. Show that if the electron and nucleus of a Bohr atom revolve about 
the center of mass, the angular momentum of the system and the allowed 
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radii and energies are all given by expressions identical to those of 
Sec. 9.4 except for the replacement of m by the reduced mass .m,. 

13. A hydrogen atom in a state with n = 6 is moving with a speed of 
5 X 107 m/s when it enters a uniform magnetic field with induction B. 
What is the minimum value of B which can ionize the atom if this can 
be done by exerting a disruptive force greater than or equal to the 
coulomb attraction? 

14. A hydrogen atom initially at rest and in its ground state absorbs 
a 100-eV photon. If the ejected photoelectron moves in the same direc- 
tion as that in which the incident photon was moving, find (a) the kinetic 
energy and speed of the photoelectron, and (b) the momentum and energy 
of the recoiling proton. 

15. Consider an electron rotating about a nucleus of charge Ze in a 


circular path of radius 1 Å. Classically, such an electron should radiate 
energy at the rate 


2 ea? 
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where a is the acceleration of the charge. If this were true, find the 
initial frequency and wavelength emitted. Show that this electron would 
spiral into a nucleus of radius 10-!* m in a time of about 10-!°Z-' s. 
Ans: } = 1.18 X 107 Z= m 
16. The Wilson-Sommerfeld quantization rule (not always valid!) states 
that $p dq = nh, where n is an integer. If an elastic bouncing ball 
follows this law, show that the allowed energies are quantized with the 
eigenvalues E = (9mg*h?n?/32)!. s 
17. A hydrogen atom at rest is struck bya second H atom with kinetic 
energy K. Show that the minimum K which is required to raise one 
of the atoms to its first excited state is 1.5 times the ionization energy. 
18. Calculate the smallest kinetic energy which an electron may have 
and still excite a hydrogen atom initially at rest. 
energy is necessary to ionize the H atom? 
19. From the values of the Rydberg constant for helium and for hydro- 
gen (and from the ratio of the atomic weights of H and He), find the ratio 
of the mass of the proton to that of the electron. 


What minimum kinetic 


chapter ter 


Particles and Interactions 


The hydrogen atom is composed of a proton serving as nucleus and a 
single electron. It was once hoped that perhaps all matter could be built 
up from protons and electrons alone. However, this is not the case. In 
this chapler we introduce some of the other particles of modern physics and 
discuss qualitatively the interactions belween them. 


10.1 Alpha and Gamma Spectra of Radioelements 


The study of the energies of the photons emitted by excited atoms is a 
primary source of information regarding the allowed energy states of 
the electrons. Similarly, measurement of the energies of the a, 6, and 
rays emitted by radioactive elements yields important clues for deter- 
mining the energy-level scheme of nuclei. 
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a. Alpha-parlicle Spectra The energies of a particlescan be determined 
from their deflection in a magnetic field, once their specific charge e/m 
is known. If the intensity of the source suffices, the measurement can 
be made in a straightforward manner, defining a narrow beam with a 
system of slits and observing the deflection in a uniform field by means 
of a photographic plate. For a precision determination, a very thin 
source must be used, in order to minimize the energy loss of the particles 
in emerging from the source. Magnetic spectrographs employing special 
geometry or specially shaped fields which focus rays emerging from the 
source over a range of angles are often used when only weak sources are 
available. A simple example of this type, the semicircular spectro- 
graph, is illustrated in Fig. 10.1. With a uniform field perpendicular to 
the plane of the figure, trajectories starting at angles differing even by 
several degrees come together again after 180° deflection and a reasonably 
sharp line image of the point source is formed. If the magnetic field 
is made to vary as the inverse square root of the radius, rays emerging 
in a cone are brought to a point focus after a 255° deflection. 

It was discovered rather early that œ particles from radioactive sub- 
stances occur in groups, each quite homogeneous in energy. In Fig. 10.2 
is shown, for example, the magnetic spectrum of Rn???, RaA(Po?!), 
and RaC’(Po?!*) obtained with a radon source. For a time it was 
thought that all œ particles from a given radioactive species had the same 
energy, and the observed complexity was attributed to the presence of 
several members of a radioactive chain in the source. Later work 
showed, however, that this was not the case and that many nuclides emit 
more than one group. In the simpler case, where only one group is 
emitted—radon is an example—the « particle is believed to result from 
a transition from the normal, or ground, state of the parent nucleus 
to that of the daughter, with a release of energy equal to the kinetic 


% Defining slits 


Source Image 


Fig. 10.1 Principle of the magnetic spectro- 
graph with semicircular focusing. There is a 
uniform magnetic field perpendicular to the 
plane of the figure. 
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Rn | RaC’ Central 
ray 
RaA 


Fig. 10.2 Magnetic spectrum of æ Particles 
from a source of radon and its daughters. The 


line at the right is from the undeflected beam, 
obtained with no magnetic field. LS. Rosen- 
blum, J. Phys. Radium. 1:438 (1930).] 


energy of the « particle plus the energy of recoil. In a more complicated 
case, like that of Bi??? (see Fig. 10.3), the appearance of groups having 
less than the maximum energy is attributed to transitions to excited 
quantum states of the residual nucleus (Tl?**), which then release their 
excitation energy in the form of y radiation. The decay of these levels 
to the ground state is thus quite analogous to the decay of excited atomic 
states, and the y spectrum corresponds to the line spectrum emitted by 
an excited atom. The correctness of this conclusion can be established 
by comparing the y-ray energies and a energies: a short-range a particle 
should be accompanied by one or more rays, such that the total energy 


equals that of the longest-range œ particles. Such a comparison is 
exhibited in Table 10.1. 


Table 10.1 Alpha particles and gamma rays of ThCt 


a-particle Disintegration y-ray 
Energy, Energy, Energy, Sum, 
MeV MeV MeV MeV 
6.0837 6.2007 6.2007 
6.0445 6.1607 0.03995 6.2007 
5.7621 5.8729 0.3267 6.1996 
5.6202 5.7283 0.4709 6.1992 


5.6012 5.7089 0.4908 6.1997 


{In the second column a correction has been made for the kinetic 
energy of the recoil nucleus. 
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P5208 


Fig. 10.3 Radiations observed in the two branches of the Bi?!?-Pb?°8 chain. The 
vertical scale is proportional to the energy released. Strong transitions are indicated 
by heavy arrows; light horizontal lines represent excited nuclear levels. 


— 
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In ThC’ (Po), formed by 8 decay of ThC, there occur, in addition 
to the main group of 8.776-MeV a particles which are believed to repre- 
sent transitions between the ground states of FhC’ and ThD, a very 
few a particles of considerably higher energy. These long-range «œ par- a 
ticles come from excited states of the parent nucleus, ThC’. This: 
behavior results from the fact that y radiation is a relatively slow process 
on a nuclear time scale, and, given sufficient energy, the a particles cam 
escape before the parent nucleus can become deexcited. The lifetime 
of ThC’ is only a fraction of a microsecond. 


b. Gamma Rays Nuclear y-ray spectra consist typically of sharp lines; 
in the naturally radioactive elements they are invariably associated either 
with or 6 emission. When a nucleus emits an «æ or $ particle, it is 
likely that the residual nucleus is left in an excited state. Ordinarily, 
when it goes to a more stable configuration, the energy released is radiated 
asa y ray. The sharpness of both a- and y-ray lines is consistent with 
the idea that excited states of nuclei ordinarily have sharp, well-defined 
energies. 

If the excited state from which a given nucleus emits a y ray is 
long-lived, all the y rays arising from a transition to the ground state 
have almost the same energy. For example, the 93-keV photon from 
Zn“ has a resonance width of 4.8 X 10-1! eV.. Thus, when emitting or 
absorbing photons, nuclei are among the most “sharply tuned” oscillators 
in nature. A free Zn‘? nucleus emitting a y ray recoils (momentum is 
conserved!) with an energy which, while small compared with photon 
energy, is large compared with the width of the y-ray line. Consequently, 
the photon does not have enough energy to raise another Zn‘ atom from 
the ground state to the excited state. In 1958 Mössbauer discovered 
that if both emitting and absorbing atoms were tightly bound in crystal 
lattices, the full excitation energy could be emitted and absorbed. 
(More exactly, the crystals as a whole take up the recoil momenta, and 
the recoil energies are negligible.) The discovery of the Méssbauer 
effect made it possible for physicists to exploit the extraordinary sharp- 
ness of y-ray lines. The 14-keV line from excited Fe'’, a widely used 
Méssbauer source, is so sharp that the doppler shift is observable for 
speeds of the order of 1 cm/min. 


10.2 Beta Rays and the Antineutrino 


B particles are electrons which may be ejected from nuclei with energies 
up to several MeV. If the intensity of the source suffices, the energies 
of the 8 rays can be measured by a $-ray spectrometer. The spectrum 
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Fig. 10.4 B-ray spectrum of Bi?!° show- 
ing the relative number of electrons 
emitted in a constant-energy interval AE 


as a function of the energy of the 
electrons, 


of the 6 rays from a s:Bi?!° source, analyzed by use of a magnetic field, 
is shown in Fig. 10.4. The curve shows a broad and continuous range 
of 6 energies. One might argue that this distribution represents £ transi- 
tions to a large number of excited quantum states of the daughter 
nucleus, but then the corresponding y spectrum would exhibit a con- 
tinuous distribution whereas the y rays are in fact discrete. Further- 
more, ĝ spectra entirely unaccompanied by y rays are not rare. It was 
early suggested that th- observed £ spectrum was distorted in some way; 
e.g., it could be conceived that the £ particles were actually homogeneous 
as they left the nucleus but that they lost varying amounts of energy 
as they went through the atomic electron cloud. That this is not true 
was established by calorimeter measurements by Ellis and Wooster in 
1927. Using a calorimeter with walls sufficiently thick to ensure that 
no radiation could be detccted outside, they established that the energy 
absorbed in the calorimeter per disintegration corresponds to the average 
B-ray energy. Thus they were forced to conclude either that the £ 
transitions leading to values other than the average violate the principle 
of conservation of energy or that some particle is emitted which is 
neither absorbed in the calorimeter nor detectable outside. The first 
suggestion was seriously discussed by Bohr on the assumption that the 
conservation of energy might apply only in a statistical sense to nuclear 
processes. This distasteful hypothesis met its fate in an analysis by 
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Ellis of the ThC — ThD disintegration energies. This transition, shown 
in Fig. 10.3, can occur in two ways: ThC goes to ThC’ by £ emission, 
with a maximum energy of 2.253 MeV; the decey of ThC’ then proceeds 
mainly by an a@ transition of 8.946 MeV (including the recoil energy), 
giving a total (maximum) energy release along this path of 11.199 MeV. 
On the other branch we have first an a transition, of energy 6.203 MeV, 
followed by a £ particle of maximum energy 1.798 MeV, followed by 
y transitions (in ThD) aggregating 3.197 MeV, giving a total energy 
change of 11.198 MeV. Thus the energy balance is accurately accounted 
for by the assumption that the energy release in a 6 decay process corre- 
sponds always to the maximum -ray energy. This result has been amply 
confirmed in later experiments on artificially produced radioactive nuclei. 
In the B decay of B'? — C!2, for example, the known mass difference 
implies an energy release of 13.370 MeV, in reasonable agreement with 
the observed maximum £-ray energy of 13.43 + 0.06 MeV. The average 
B-ray energy in this case is less than 7 MeV. It has also been established 
that not only energy but also linear momentum and angular momentum 
are inadequately accounted for by the observed 8 particles. 

If some other particle were emitted simultaneously with the elec- 
tron, this particle would share the total energy available and the 6 
spectrum would be continuous. In 1930 Pauli suggested that there was 
indeed an uncharged particle of zero or small rest mass which accompanied 
each electron from a nucleus. This particle was originally called the 
neulrino, meaning “little neutral one,” but for reasons to be developed 
later, it seems preferable to call it the anlineulrino. Then the £ spectrum 
of Fig. 10.4 arises from the reaction 


s2Bi?!° — g3Po07!° + _18° + 9 


where the antineutrino is represented by 7 (not to be confused with 
wavenumber). For many years all evidence for the existence of such a 
particle was indirect, in that energy, momentum, and angular momentum 
could be conserved only if there were an elusive and undetected third 
particle on the right of the reaction above. However, in 1953 and subse- 
quently, certain experiments involving extremely sensitive detecting 
apparatus have strongly supported the antineutrino hypothesis. 


10.3 Masses of Atoms 


In the radioactive chain of Sec. 8.6 we saw that isotopes of two chemically 
identical species may have entirely different radioactive properties. 
Isotopes have the same value of nuclear charge Ze but different atomic 
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Fig. 10.5 Schematic diagram of Goldstein's discharge tube. 


masses; thus the mass is of major importance in determining the properties 
of the nucleus. 

The first step in the development of direct, precision mass-measuring 
techniques for atomic masses came with the discovery of the canal rays 
in the Crookes discharge tube, by Goldstein in 1886. Goldstein’s tube 
is schematically illustrated in Fig. 10.5. Ina glass envelope are provided 
two electrodes: the anode A, a simple side connection, and the cathode 
K, which is perforated by a long canal. When the tube is evacuated 
to the point where the Crookes dark space is well developed, one sees 
the cathode rays streaming upward and, in addition, a stream of rays 
issuing downward from the canal in the cathode. If the residual gas 
in the tube is neon, the ionization due to the cathode rays produces a 
pale blue light, while the canal rays are a brilliant red. Unlike the 
cathode rays (electrons), the canal rays are quite difficult to deflect with 
a magnetic field. It was not, in fact, until 1898 that Wien exhibited the 
effect with a powerful electromagnet, and showed, by simultaneous 
application of electrostatic and magnetic deflections, that the values 
of e/m were of the order of those observed for ions in electrolytic cells. 
Because he operated at too high pressures, Wien was unable to make a 
precise determination, and found a continuous spread of e/m values. 

In a series of experiments initiated in 1906, J. J. Thomson made a 
careful study of the formation and properties of the canal rays. His 
apparatus is shown diagrammatically in Fig. 10.6. The large discharge 
tube B is provided with an anode (not shown) and a cathode K, perforated 
with an extremely fine hole. (In some of his later experiments, Thomson 
hade this canal by ruling a scratch in a piece of flat steel and laying 


Particles ana snteraclions 259 


another piece on top of it.) The canal rays, or, as Thomson came to 
call them, the positive rays, stream out of the canal into the space C, 
where they strike a screen covered with a scintillating material or a 
photographic plate P. A magnet M,M; with electrically insulated pole 
pieces A1 A: provides a strong magnetic field, and by electrical connection 
to A, and Ao, an electrostatic field parallel to the magnetic field can be 
applied. Thomson used pressures of the order of 10-? mm Hg and 
discharge voltages from 1000 to 20,000 V. 

In passing through the magnetic and electrostatic fields at A: A», the 
particle suffers simultaneous deflections along the z axis (perpendicular 
to the plane of the drawing) and along the y axis. The displacement at 
the end of the gap due to the magnetic deflection is 


ppl ee 3 (0.1) 


and the displacement due to the electrostatic deflection is 


1 Ee (ÑN? 
ais (O (10.2) 


where l is the length of the gap. Since the rays travel in straight lines 
after leaving the gap, the displacement on the plate PP is proportional 
to these quantities. To obtain the shape of the trace on the plate, we 
eliminate the velocity, obtaining 

B? e 


2eCRey (10.3) 


where C is a constant depending on l and other geometrical features. 
This equation represents a parabola, with vertex at the origin; ions of 


Fig. 10.6 Thomson's positive-ray spectrograph. 
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Positive 
ion beam 


Fig. 10.7 Parabolic traces on a film are formed by positive ions of the 
same charge-to-mass ratio but different velocities when deflected by parallel 
electric and magnetic fields, Reversal of the magnetic field produced the 
other half of the parabolas in the insert. 


various velocities but with the same value of m/e lie at various points on 
the parabola, the highest velocities nearest the vertex. If we consider 
ions of only a single charge, the highest velocity will be given by 

moas = Ve 
where V is the potential across the discharge tube; inserting this in the 
expression for y’ above, we see that 


Thus all parabolas, regardless of the value of m/e which they represent, 
cut off at the same value of y. 

A photograph showing the traces obtained with varius gases is 
reproduced in Fig. 10.7. It shows clearly the parabolic form and the 
abrupt termination at a constant value of y. The fact thal these traces are 
sharp constitules the first experimental proof thal atoms have discrete masses— 
or at least discrete values of e/m—and are not spread over a conlinuous range. 


10.4 Isotopes of Stable Elements 


In an investigation (1913) of the composition of liquid-air residues, 
Thomson discovered. in addition to the neon line at the position expected 
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for mass 20, a faint, just resolved line at mass 22. After a number of 
tests to eliminate other possible sources of such a line, he concluded that 
this line was due to another component of neon, with mass 22. This was 
the first observation of the isotopic constitution of a stable element and 
verified the prediction made by Soddy that elements with atomic weights 
differing markedly from integral values would be found to have a com- 
plex isotopic constitution. 

The subsequent development of mass-measuring techniques in the 
hands of Aston, and later Dempster, Bainbridge, Mattauch, Nier, and 
others, has enormously improved the sensitivity and precision of the 
method. Fundamental to all these improvements has been the use of 
focusing principles which greatly increase the intensity available at the 
detector. The development of more efficient ion sources has also been 
a matter of great importance in increasing the usable ion currents and 
extending the techniques to materials which are not available in gaseous 
form. The applications of the mass spectrometer can be generally 
divided into three categories: the identification of isotopes, the deter- 
mination of the relative abundances, and the precise measurement of 
their masses. 

Coupled with the increase in sensitivity gained in the course of 
50 years’ development of the mass spectrograph has also come an enor- 
mous increase in resolution and accuracy. Like chemical atomic weights, 
the masses of isotopes are seldom expressed in absolute units but are 
given in terms of an adopted standard. The standard for the unified 
alomic mass unil’ is based on the most abundant isotope of carbon with 
the mass of a C?? atom taken as exactly 12 of these units (abbreviated 
u by international agreement). The absolute value of this mass unit 
in kilograms can be obtained from the Avogadro number (which is, in 
turn, derived from several other basic physical constants). Thus 


1 


Me ip 


= 1.6604 X 10-77 kg 


and the rest energy of 1 u is 931.48 MeV. 

The most precise direct mass measurements are now made by the 
so-called doublet method, in which two ions of the same nominal mass 
are compared. Figure 10.8 shows the (He‘)+-(H3)* doublet; the left- 
hand line is due to singly ionized helium atoms—4 u and one charge— 
and the right-hand line is from singly ionized deuterium molecules—also 4 

1 The alomic mass unit was originally defined as one-sixteenth the mass of an 
O'* atom and formed the basis for the physical scale of atomic weights. It differed 
slightly from the chemical scale, which was based on natural oxygen with its small 


amounts of OY and O18. The unified atomic mass unit is 0.0318 percent larger than 
the atomic mass unit based on O18. 
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Fig..10.8 Mass spectrograph record of 
({le‘)*—(I1,2)* doublet; in unified atomic 
mass units the doublet separation is 
0.025596 u. [H. Ewald, Z. Naturforsch., 
5a, p. 1 (1950).] 


u and one charge. The center-to-center spacing can. be determined to an 
accuracy of about 1 part in 3000. If the dispersion of the spectrometer 
is known to the same accuracy, the value of the He‘ mass relative to 
that of the deuteron can be deduced with an accuracy of 0.000008 u. 
By use of such other doublets as (H})+-(H*)+, (C'H})+-(01%)+, and 
(Hj)+-(C"*)+* the mass can be linked to the standard C2, using only 
mass-difference measurements. A few masses of common nuclides 
obtained in this way are given in Table 10.2. 

Between 1919, when Aston’s first spectrograph was completed, and 
1922, the isotopic constitution of 27 elements was investigated, and in 
the following 2 years 26 more were reported. A table published by 
Bainbridge in 1953 lists over 280 naturally occurring isotopes, most 
of which have been discovered by the mass-spectrographic techniques. 
In every case where the chemically determined atomic weight differs 
greatly from a whole number, the element is found to consist of more 
than one isotope. We may anticipate our later discussion of the exact 
masses by saying that the isotopic masses observed are not quite integral 
multiples of the atomic mass unit, but all stable isotopes have masses 
which lie within less than 0.1 u of an integer; it is this integer that we 


call the mass number A. 
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Table 10.2 Masses in unified atomic mass units; 
C! = 12.000000 


Nuclide Mass Nuclide Mass Nuclide Mass 
Neutron 1.008665 Lis 6.015126 skis 18.998405 
Proton 1.007276 aLi? 7.016005 uNa? 22.989771 
1H? 1.007825 «Be? 9.012186 Al? 26.981539 
iH? 2.014102 sB10 10.012939 29Cu63 62.929592 
1H? 3.016050 sB! 11 .009305 aAg!? 106 905094 
2H? 3.016030 N1 14.003074 pAu! 196.966541 
2Het 4.002603 s016 15.994915 s028 238 .050770 


10.5 The Nucleons 


Since all nuclei have charges which are integral multiples of the protonic 
charge, it is reasonable and fruitful to assume that a nucleus of atomic 
number Z contains Z protons. As we saw above, all atomic masses are 
very nearly an integral number A of atomic mass units, but A is roughly 
twice Z. In 1920 Rutherford suggested that there might be a particle 
of roughly the proton mass but no charge. This particle, known as the 
neutron, was discovered in 1932, and we now think of any nucleus as 
being a collection of Z protons and A — Z neutrons. 

Protons and neutrons are referred to as nucleons. In theoretical 
treatment of nuclear structure it is common to regard the proton and 
neutron as different charge states of “the nucleon.” The proton and 
the neutron have comparable sizes with radii of the order of 2 X 10-15 m; 
both have intrinsic spin angular momentum such that the component 
in the direction of an applied external field is +-A/2 or —h/2. 


Associated with the spin of the proton there is a small magnetic 
moment (Sec. 9.8) 


Hp = 1.521 X 10-* Bohr magneton = 2.79275 


2M, (10.4) 


where M, is the proton mass. It is customary to measure nuclear 
magnetic moments in terms of the unit eh/2M,, which is called the 
nuclear magneton. Similarly the neutron, with no net charge, possesses 
a magnetic moment comparable in magnitude but opposite in sign to 
that of the proton; 


Hn = —1.91315 nuclear magnetons (10.4a) 
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In addition to mass, each nucleon contributes spin angular momen- 
tum to a nucleus, but this is a vector quantity, and the resultant nuclear 
spin is usually not large—indeed, it is typically zero for nuclei with even 
numbers of both protons and neutrons. Similarly, each nucleon con- 
tributes to the magnetic moment of the nucleus; however, the resultant 
magnetic moment cannot be predicted with precision by adding the 
individual magnetic moments. 

The neutron is slightly more massive than the proton 


Il 


M, = 1.6725 X 10-*7 kg = 1.00728 u; 
M,, = 1.6748 X 10-27 kg = 1.00866 u; 


i] 


M,c? = 938 MeV; M,c? = 939.5 MeV. A free neutron is unstable, 
decaying by 8 emission with a half-life of 12 min in the reaction 


wt > H+ ye 4 5 (10.5) 


Although nuclei are exceedingly small, several types of measure- 
ments have heen devised to determine nuclear radii. While there are 
minor differences in the empirical values, all methods agree on the order 
of magnitude. The volumes of all nuclei are roughly proportional to 
the number of nucleons A. If we assume a roughly spherical shape, we 
find that the radius of a nucleus of mass number A is given by 


r=12X10-SW4 m =12WA fermi (10.6) 


The heaviest of all common nuclei is that of uranium 238, for which the 
nuclear radius is somewhat less than 10-14 m, as compared with the 
typical atomic radius of about 10-° m. The densities of nuclei of all 
kinds are roughly equal, about 2 X 107 kg/m’, f 


10.6 The Positron 


In 1932 Anderson found tracks of a particle with the same mass as the 
electron but positive charge in cloud-chamber studies of cosmic rays. 
These particles, called positrons, appear when high-energy photons 
(hv > 1.02 MeV) interact with nuclei; each positron is accompanied 
by an electron. The reaction may be written 


hy xme? + ime? + Kt + K- (10.6a) 


eS 
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where ım and _,m are respectively the masses of the positron and elec- 
tron and K+ and K- are their kinetic energies. The nucleus in the field 
of which the interaction takes place does not appear in (10.6a), but the 
involvement of another particle is required to satisfy the conservation 
of momentum. 

In this reaction we have the conversion of the electromagnetic 
energy of the photon into the rest masses of the positron and electron and 
their kinetic energies. The process is known as pair produclion. The 
rest energy of the positron is 0.511 MeV, equal to that of the electron, 
so that the threshhold energy for pair production is 1.02 MeV. Only 
photons with energy greater than 1.02 MeV can create pairs; any addi- 
tional energy of the photon is shared between the particles as kinetic 
energy. 

Positrons exist for a short time. As they move through matter, 
they lose energy rapidly. Eventually they combine with an electron 
in a process called pair annihilation, with the conversion of their energies 
into electromagnetic energy. As the positron slows down, it may pick 
up an electron and form posilronium, a hydrogen-like “atom” in which the 
two particles “revolve” about their center of mass. Usually formed in 
a state of high angular momentum, the positronium makes successive 
radiative transitions until the ground state with energy of 6.7 eV is 
reached. If pair annihilation occurs when the linear and angular 
momenta of the positronium are zero, two 0.511-MeV photons are emitted 
in precisely opposite directions; only in that way can the conservation 
laws be satisfied for two-photon decay (see Sec. 10.10). 

The existence of the positron had been predicted by Dirac in 1928 
in his development of a theory of the electron which was invariant under 
Lorentz transformation. According to this theory an electron has energy 
+ V/(moc?)? F p*c?, where p is the momentum; thus negative as well 
as positive energies are allowed. These ideas can perhaps be made a 
little clearer by reference to the diagrams of Fig. 10.9. Here we repre- 
sent the possible energy states of a free electron by a continuum, extend- 
ing from E = +my,c? upward and from Æ = —moc? downward. The 
lower levels are all filled, and their effect merely adds a constant (infinite) 
charge which is undetectable. In Fig. 10.9b a y ray of energy hv (> 2m?) 
raises an electron in a state of energy — E, to a state of positive energy 
+E,: the electron now has positive kinetic energy K = hv — E, — moc’, 
and the “hole” behaves like a positron of (positive) kinetic energy 
E, — moc?. Since momentum cannot be conserved in the interaction 
of a y quantum and two electrons, the event must take place near another 
particle (usually a nucleus) which can take up the excess momentum. 
The interaction with a nucleus takes place through the coulomb field, 
and calculation gives a roughly Z? dependence for the cross section for 
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Transitions between positive and negative energy states of free electrons, 


tron at rest (Fig. 10.9¢). Mom 
if two quanta, each of energy moc?, are emitted. One-quantum annihila- 
tion, resulting in a single y ray of energy 2myc?, is also possible if the elec- 
tron is strongly bound to a’ nucleus, Positrons can be annihilated in 
flight through collisions with electrons, but the probability falls off 
rapidly as the velocity increases. The lifetime of a very slow positron 
in lead is about 10-10 s. The positron is called the antiparticle of the 


electron or the antielectron, since a positron and an electron can annihilate 
one another. 


10.7 Muons and Pions 


In 1936 Anderson and Neddermeyer, in cloud-chamber photographs of 
cosmic-ray events, found evidence of particles of mass 207 times that of 
the electron. These particles, called muons, may bear a charge of +e or 
—e. Both the „+ and „` particles are short-lived, decaying by the 
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reactions 
>P HiH oy (10.7a) 
wt pttyt Dy (10.75) 


where the overbar implies an antiparticle and v, is the mu neulrino, a 
particle similar to the neutrino but distinguishable from it. A decade 
later another particle of intermediate mass, called the pion or + meson, 
was discovered. Pions may be positive, negative, orneutral. The mass 
of a charged pion is 273 electron masses, while that of the neutral pion 
is 264 electron masses. Negative pions are strongly attracted to nuclei 
and are quickly captured in dense materials; positive pions are repelled 
by the nuclear charge and are less likely to be captured. ` Isolated pions 
of all varieties are unstable and typically decay by the reactions 


Te + A (10.82) 
TË — pt + vy (10.8b) 
T’ — 2 67-MeV y rays (10.8c) 


Pions are intimately linked to the attractive force between nucleons. 


Other mesons, in addition to pions, have been discovered (see 
table inside rear cover). 


10.8 Particles and Antiparticles 


Dirac’s theory anticipating the positron (See. 10.6) could be interpreted 
as implying a particle identical to the proton except for a negative 
charge. The existence of this particle, the anliproton, was established in 
1955 by Segrè, Chamberlain, and their collaborators. Antiprotons were 
produced by bombarding protons in a target with 6-GeV protons, 
thereby inducing the reaction 


P + p (+ energy) > p + p +p+p (10.9) 


P+ por+m+aette tq (10.9a) 
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(a) (b) 


eg > W : 

Fig. 10.10 (a) Bubble-chamber photograph showing the path of an anti- 
proton entering from below. Near the bottom of the picture the antiproton 
annihilated a proton, forming two positive pions, two negative pions, and a 
neutral pion, which leaves no track. In the upper quadrant of the picture 
a negative pion interacted with a proton. Among the resulting products 
were a positron, which formed the counterclockwise spiral at top center, and 
an electron, which produced the smaller clockwise spiral. (b) Explanatory 
sketch. (Courlesy of Lawrence Radiation Laboratory, University of California.) 


One year later the antineutron ñ was discovered at the same labora- 
tory. Since the neutron bears no charge, the antineutron is also neutral. 
It is quickly annihilated, either by a proton or a neutron, usually with 
the production of several pions. If an antineutron is not annihilated 
by a nucleon, it decays by the reaction ñ — p + Bt + v. 

When a beam of high-energy protons bombards a target, many 
kinds of particles may appear in addition to those already introduced. 
The more common “elementary” particles are listed in the table inside 
the rear cover. They fall into four natural divisions: (1) photons, with 
spin (quantum number) 1 and zero rest mass; (2) leptons (light particles), 
with spin and rest mass between 0 and 210 electron masses; (3) mesons, 
with 0 or integer spin and rest mass greater than 260 electron masses; 
(4) baryons (heavy particles), with masses greater than 1800 electron 
masses and with half-integral spin. It appears that all baryons, leptons, 
and charged mesons have distinct antiparticles. (Of course, any charged 
antiparticle bears a charge opposite to that of the corresponding particle.) 
Photons and the neutral 7° and 7 mesons do not possess distinct anti- 
particles but are said to be their own antiparticles. However, there 
are two distinct neutral K mesons which form a particle-antiparticle 
pair, just as the neutron and antineutron do. 

In atomic physics it has long been useful to consider an atom as 
composed of extranuclear electrons and a nucleus formed of protons 
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and neutrons. There is reason to believe that a positron and an anti- 
proton could form an atom of antihydrogen, which would have a spec- 
trum similar to that of ordinary hydrogen. Indeed, from a collection 
of antiprotons, antineutrons, and positrons a world of antimatter might 
be constructed which could be indistinguishable from our world so long 
as everything were made of antiparticles. However, if some of this 
contralerrene matter were to come in contact with ordinary matter, 
particle-antiparticle annihilation would occur with a tremendous release 
of energy. At one time it was thought that if we knew a decay mode 
of any particle, we could immediately write a corresponding possible 
decay mode for its antiparticle by changing each particle in the reaction 
to its antiparticle [just as was done for the muon decays in Eq. (10.7a, 6)). 
The replacement of all particles in a reaction by its antiparticle (including 
neutrals) is known as charge conjugation. It is now known that the 
principle of charge-conjugation invariance is violated in so-called weak 
interactions. 

All leptons and baryons have half-integral (3, 3, . . .) spin quantum 
numbers, and all obey Fermi-Dirac statistics (Chap. 20). If we assign 
to each baryon the baryon number A = 1, to each antibaryon A = —1, 
and to other particles A = 0, we find that in every reaction the total 
baryon number is conserved. Thus in Eq. (10.9) the total baryon 
number remains 2, while in Eq. (10.5) itis 1. Note that for any nuclide, 
such as C?!?, the superscript represents the baryon number. As far as 
is known, the conservation of baryons is always obeyed in every sort of 
particle decay and nuclear reaction. Baryons more massive than the 
proton are known as hyperons; all except neutrons in nuclei are unstable. 

Similarly, if we assign a lepton number of 1 to the u-, e7, v, and v, 
particles and a lepton number of — 1 to p+, e+, 7, and 3,, we find that there 
is a similar conservation of lepton number in all reactions. In Eq. (10.5) 
we have total lepton number zero, while in Eqs. (10.7a, b) the lepton 
numbérs are 1 and —1 respectively. We now see why the particle 
accompanying the electron in a £ decay is called an antineutrino. Before 
the decay we have no leptons; after the decay there is one lepton and one 
antilepton, or a net zero lepton. 

On the other hand, photons (spin #) and mesons (spin quantum 
number 0 or integer) are nol conserved in reactions. They obey different 
statistics (Bose-Einstein) than baryons and leptons, as we shall see in 
Chap. 20. 

In addition to listing many of the particles of modern physics, the 
table inside the rear cover shows the dominant decay modes for the 
unstable particles. Since neutral particles do not leave a trail of ions 
in cloud chambers or bubble chambers, their paths are not visible in 
photographs of the chamber. The presence of such particles is inferred 
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from indirect evidence such as the subsequent appearance of charged 
particles (arising either from the decay of the neutral one or from a 
reaction induced by the latter) or from the failure of one or more of the 
conservation laws. In view of the important role of conservation prin- 
ciples in physics, we set out below those laws which we shall hereafter 
assume are applicable to all processes. 


10.9 The Conservation Laws 
From classical physics come the following laws: 


1. Conservation of momentum 
2. Conservation of energy 
3. Conservation of angular momentum 


4. Conservation of charge 


For any isolated system each of these four quantities remains constant. 
(As we saw in Chap. 3, the conservation of momentum and of energy 
may be regarded as a single law if energy/c is regarded as the time 
component of the momentum four-vector for the system.) 

Further, we introduce the particle-conservation laws: 


5. Conservation of baryons. In any reaction the total baryon number is constant. 


6. Conservation of leptons. Jn any reaction the total lepton number remains 
constant. 


There are, in addition, more sophisticated conservation laws for 
which additional background must be supplied before they can be prop- 
erly introduced. 


10.10 Interactions between Particles 


The particles of modern physics interact with one another to form 
composites, some stable (like an oxygen atom) and others transient (like 
positronium). It appears that the forces involved in the operations of 
the physical universe can be classified into four types; in the order of 
increasing relative strength they are (1) gravitational interactions, (2) 
the weak interaction, which governs the decay of many unstable nuclei 
and other particles, such as muons, (3) the electromagnetic interactions, 
which act on all charged particles and which involve the photon, the 
fundamental unit of electromagnetic radiation, and (4) the nuclear (or 
strong) interaction. 

It is interesting that it is the weakest force which was first under- 
stood quantitatively. The gravitational force is always attractive, its 


Particles and Interactions 271 


range is in principle unlimited, and the gravitational field at the earth’s 
surface is a dominant factor in the motions of macroscopic bodies. The 
gravitational force Fg between two particles of masses mı and mz is 
given by Newton’s law of universal gravitation Fg = Gmim2/r?, where 
G = 6.670 X 10-7" N-m?/kg? and r is the distance betw 


= qıq2/4reor?, where 
The electrostatic interaction has 
ce as the gravitational, but for 
Clearly, gravitational forces between 
e are quite negligible. Two charges 
through the 


Thus, between 
ts a Velocity-dependent 


e. Furthermore, 


een developed 5 
force laws are 


` 
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Some particles are coupled to all four interactions. For example, 
the proton is a strongly interacting nucleon which, by viae of its 
charge, also participates in clectromagnetic interaction and, by virtue 
of its mass, in the gravitational interaction. Further, it is involved in 
positron emission from a radioactive nuclei, whereby it is transformed 
into a neutron, neutrino, and a positron through the weak interaction. 
The electron and the charged muon participate in all but the strong 
interaction. 


10.11 Preternatural Atoms 


The hydrogen atom consists of an electron bound to a proton through 
the electromagnetic interaction; other atoms are composed of a positive 
nucleus binding enough electrons to balance the nuclear charge. How- 
ever, all charged particles participate in the electromagnetic reaction, 
and there exist a number of prelernatural aloms formed by various 
combinations of charged particles. 

Positronium (Sec. 10.6) is an example of such an “atom”; it consists 
of a positron and an electron bound in a hydrogen-like system. There 
are two forms of positronium: parapositronium, in which the z components 
of spin angular momenta of the particles are opposite, and orlhoposi- 
tronium, in which they are in the same direction. In the ground state 
neither form has any orbital angular momentum. Consequently, in the 
ground state parapositronium has no angular momentum, while ortho- 
positronium has one unit. The former, with a half-life of 8 X 10~° s, 
usually decays by two-photon emission. Ina frame in which the center 
of mass is at rest these photons must be emitted in opposite directions 
with equal momenta and energies of 0.511 MeV. Each photon bears 
one unit of angular momentum which add vectorially to give zero. For 
orthopositronium it is impossible to satisfy the conservation laws by 


two-photon decay, and annihilation ordinarily occurs with three photons 
produced. 


s 


In 1949 Chang reported that for every negative muon stopped in 
lead, an average of roughly three photons were emitted with energies 
between 1 and 5 MeV. These photons arise from muon transitions 
between states directly analogous to the Bohr-Sommerfeld levels. As a 
muon passes through matter, it loses kinetic energy and may be captured 
in a Bohr orbit about the nucleus, thus forming a muonic atom with a 
nucleus of charge Z, a muon, and Z — 1 electrons. The captured muon 
makes a series of transitions, reaching the ground state (n = 1) in about 
10-3 s. Thereafter the muon either decays, Eq. (10.7a), or is captured 
by a nucleon in the reaction w= + p>n+ vp. 

If for a moment we consider the nucleus as a point charge, the 
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radius r of a muonic Bohr orbit would be 337 the radius of the corre- 
sponding electron orbit. For lead one finds r = 3 X 10-5 m for n = 1 

while Eq. (10.6) gives a radius for the lead nucleus just under 10-"4 So 
If this value is accepted, the muon n = 1l orbit lies well within the 
nucleus, while the n = 2 orbit lies just outside. Experimentally one 
finds that the n = 2 ton = 1 muon transition in lead leads to a photon 
of about 6 MeV energy, whereas for a point nucleus the predicted energy 
is 16 MeV. Thus the finite nuclear size results in a large energy reduc- 
tion for what might be called the muonic Ke line for lead. Study of 
muonic atoms can yield important information regarding (1) nuclear 
radii, (2) nuclear quadrupole moments, (3) the magnetic moment of the 
muon, (4) possible nonuniformity of nuclear charge distribution, and (5) 

nuclear polarizability and compressibility. For probing the nuclear 

electric field the „~ is an ideal test particle. Its interaction with nucleons 

is exceedingly weak, and in its brief half-life the »~ may traverse several 

meters of nuclear matter with average density of 10 kg/m’. 

X-rays have been observed not only from muonic atoms but also 
from the capture of m- and K- particles. Since pions and kaons par- 
ticipate in the strong interaction, they are quickly absorbed by nuclei, 
so that a z- meson, for example, is ordinarily absorbed before it reaches 
the n = 1 level except in the case of very light nuclei. 

A positive muon may capture an electron to form an “atom” of 
muonium, a light isotope of hydrogen with the muon replacing the 
proton. The energy levels are very close to those of hydrogen, differing 
slightly because of the smaller reduced mass (and, in detail, because the 
muon magnetic moment is 3.183 times that of the proton). Still other 
“atoms” can be formed with an electron and one of the other positive 
particles, for example, 7+, K+, 2+; in each case the life time of the atom 
is essentially that of the positive particle. 


Problems 


1. A collimated beam of singly ionized carbon atoms with 4 keV 
kinetic energy enters a region where there is a uniform magnetic induc- 
tion of 0.25 Whb/m? perpendicular to the velocity. If the ions travel a 
semicircular path to a photographic plate, find the radii of the paths of 
C? and C!* ions and their separation at the plate. 

Ans: 12.62 cm; 13.14 em; 1.04 cm 
2. Imagine a model of an electron as a solid sphere of radius5 X 10-1 m. 
Find the angular velocity and tangential velocity of such a sphere it 
its angular momentum is V3h/2. 


Ans: 1.0 X 10® rad/s; 5 X 10° m/s (~17c) 
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3. Show that pair production and pair annihilation with only a single 
photon emitted cannot occur unless,some other particle is involved in 
addition to the pair and the photon. \ 


4. Take mass-spectrographic doublets 


C:H;-CO, = a 
CO.-CS = b 
CeH,-CS. =d 


Show that if the mass of C'? is 12 exactly, 
H = 1+ (2a + 2b — d)/12. \ 
5. The triton (1H?) is unstable, emitting a ray with a maximum energy 
of 18.5 keV. Compute the expected erie energy from the masses 
of H? and He? in Table 10.2. ` 


6. Assuming that a deuteron can be crudety approximated as a proton 
and a neutron with centers 2.5 X 10-15 m apart and a binding energy 
of 2.2 MeV, compare the energy associated with the strong (nuclear) 
interaction (a) to the electrostatic potential energy of two protons the 
same distance apart and (b) to the gravitationa potential energy of 
two nucleons. What is the ratio of the coulomb repulsive force between 
two protons to the gravitational attractive force? . 

T. A cosmic-ray photon with à = 0.00400 A inte cis with a lead 
nucleus to form a pair in a cloud chamber. There is a yom magnetic 

e 


field of 0.050 Wb/m?, which happens to be normal to the Ocipy vector 
of the electron. , \ 


the mass of hydrogen is 


N 
(u) If the electron ends up with 40 percent of the total kinetic 


energy of the pair, find the radius of its path in the field. 

(b) If, as the positron loses its kinetic energy, it captures an elec- 
tron in the fourth Bohr orbit, calculate the energy of the photons emitted 
if the ground state is reached in three transitions. 


Ans: (a) 8.3 cm; (b) 0.33, 0.94, 5.1 eV 


8. An excited Fe’ nucleus emits a y photon with an energy of 14.4 keV. 
If it was initially at rest and free, calculate the recoil energy of the 
nucleus. How many times as great is this recoil energy than the width 
of the y-ray line, which is found to be 4.6 X 10-® eV? At what speed 
would a second Fe*7 nucleus in its ground state have to approach the 
emitting nucleus if the secon? nucleus is to absorb the photon and end 
up at rest in its excited state? 3 
9. A nucleus is unstable against a decay if its mass exceeds the sum 
of the mass of an « particle and the mass of the residual nucleus. 


(a) If the mass of Th“ js 234.043583 u, calculate the maximum 
kinetic energy of the a particles from U238, 
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(6) Show that the maximum kinetic energy in MeV of the a 
particles from any a emitter initially at rest is given by 


K = 931.48(2M4 — zM — Hes ( 4 4) 


where the masses of the atoms are in unified atomic mass units. 

10. A positron and an electron form a positronium “atom” in a state of 
high excitation. The positronium emits photons as it undergoes transi- 
tions to lower energy states. Calculate the energies of photons from 
transitions in positronium from n = 5 ton = 3 and fromn = 3 ton=1, 
11. Show that the conservation laws forbid the annihilation of isolated 
orthopositronium in a two-photon process but that two-photon annihila- 
tion of parapositronium satisfies the conservation laws. 

12. In a three-photon annihilation of positronium it is observed that 
a 0.42-MeV photon goes in the —z direction and a second photon goes 
off at 37° with the z axis. If the positronium was initially at rest, find 
the energies of the second and third photons and the angle the latter 
makes with the z axis. 


Ans: 0.35 and 0.25 MeV; 56° 


13. Ina cloud chamber a positron with kinetic energy K+ and momen- 
tum p+ and an electron with kinetic energy K- and momentum p- are 
produced when a high-energy y photon interacts with a nucleus. Find 
the velocity of the reference frame S’ in which the pair has zero momen- 
tum. Compute the kinetic energy of the positron in the S’ frame. 
14. A negative muon is captured by the electric field of a lead nucleus 
to form a muonic atom. If the muon is in a state with n = 5 and the 
Bohr model is assumed, calculate the energy of the state and the orbit 
radius. If the muon makes a transition to the state with n = 2, find 
the energy of the photon emitted and the new orbit radius. 

15. For what nuclide would the Bohr radius for a negative pion be 
most nearly equal to the radius of the nuclide if the latter is given by 
Eq. (10.6)? (Assume that A = 2Z, a fair approximation for the case. 
in point.) On the basis of the Bohr model what is the energy of the pion 
for this situation? 


16. Compute the maximum energy of a positron arising from the decay 
of a positive muon at rest. Assume the rest mass of both neutrinos to 
be zero. 

17. Find the momentum carried off by the mu neutrino in the decay 
of a pion at rest. 


Ans: 1.57 X 10-*° kg-m/s 
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18. ions created in a high-energy accelerator have a speed v = 0.800c 
in ffe laboratory frame. In this frame (a) what is the apparent half- 
life of the pions and (6) what is the distance traversed by a pion during 
this time? 

19. If the decaying particle is initially at rest, calculate the energies 
of the y photons emitted in each of these reactions: (a) t? — y + 7; 
O) ny +y (©) BAT: 

20. Stars resulting from a nuclear disintegration in a photographic 
emulsion often involve the emission of positive and negative pions. 
Assuming that the pions originate at the surface of an Ag nucleus 
(radius ~ 5 X 10-5 m), estimate the minimum energy a m~ must have 
to escape from the nucleus. Explain why positive pions observed in the 
emulsion have energies above 4 MeV, while many negative pions have 
much lower energies. 

21. Among the many possible reactions when an antiproton is captured 
by a deuteron is p + d—>n + 7°. If both the p and the deuteron were 
initially at rest, find the total energies of the 7° and the neutron after 
the interaction. 


Ans: 1.25 GeV, 1.56 GeV 


chapter eleven 


Wave Properties of Particles 


One must approach the subject of this chapter philosophically, prepared 
lo accept conclusions which at first thought seem at variance with our senses 
and with a belief that has persisted almost unquestioned from the lime of 
the Greeks, viz., thal matter is made up of parlicles. We have seen in Chap. 7 
thal light possesses both undulatory and corpuscular characteristics. But 
even so, we might say, light differs from matter; whereas we can determine 
the nature of light only by indirect observation, maller we can see. We 
observe directly that a handful of sand is made up of real particles. In 
ordinary experience they certainly do not exhibil wavelike characteristics. 
The particles of sand which we see so clearly are, however, made up of 
molecules or atoms, and these of electrons and protons and neutrons, none 
of which we can see directly any more than we can light waves or photons. 
It is with these so-called particles, evidence concerning which is almost as 
indirect as with photons, that the wave theory of matter is primarily con- 
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cerned. We attempl in this chapter to give a brief introduction to this most 
important subject. Our purpose will be lo show how the concept of malter 
waves can be developed from previous concepts of classical and quantum 
physics and to summarize the perlinent experimental evidence. 


11.1 Matter Waves 


With the discovery of the law of the conservation of energy toward the 
middle of the nineteenth century, it became accepted by physicists that 
the physical universe is made up of two great entities, viz., matter and 
energy, each of which is conserved. These two great conservation laws 
provided much of the foundation upon which classical physics was 
built. By 1900, the corpuscular nature of matter had become firmly 
established; likewise the undulatory nature of light. By 1910, further- 
more, Planck’s quantum theory and the Einstein photoelectric equation 
together with various lines of experimental evidence had made it clear 
that light itself possesses corpuscular characteristics. By 1920 the dual 
nature of radiant energy was generally recognized, although not 
understood. 

During all this time, there was no suggestion that matter was 
anything but corpuscular. But, in 1924, Louis de Broglie made the 
bold suggestion that particles of matter, and in particular electrons, 
might possess certain undulatory characteristics, so that they, too, might 
exhibit a dual nature. He suggested a way in which the undulatory 
characteristics of electrons might furnish a new basis for the quantum 
theory. The reasoning used might be paraphrased as follows. (1) 
Nature loves symmetry. (2) Therefore the two great entities, matter 
and energy, must be mutually symmetrical. (3) If (radiant) energy is 
undulatory and/or corpuscular, matter must be corpuscular and/or 
undulatory. 

In his first paper, de Broglie was concerned primarily with develop- 
ing a theory of light in terms of light quanta, or photons. If the energy 
of the light is concentrated in photons, how are the phenomena of inter- 
ference to be understood? There must be waves of some sorl associated 
with the photons, in order to account for the observed interference effects. 
We can no longer suppose that the energy is spread out over these waves, 
as in classical theory; nevertheless, the waves must somehow determine 
where, in an interference pattern, the photons can produce effects by 
being absorbed. The details of interference patterns depend largely 
upon the phase relations of the waves; hence, de Broglie called the 
latter phase waves. He assumed their frequency v to be such that the 
energy in a photon equals hv. 
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A material particle carries energy; according to de Broglie, the 
basic idea of the quanıum theory is the impossibility of imagining an 
isolated quantity of energy without associating with it a certain fre- 
quency. Material particles ought, therefore, like photons, to be accom- 
panied by phase waves of some sort; and these waves ought, under 
suitable circumstances, to give rise to interference effects. Furthermore, 
the waves associated with a particle should have a frequency equal to 
the energy of the particle divided by h. De Broglie proceeded to extend 
wave-particle dualism to material particles in the only relativistically 
invariant way possible. He included the rest energy moc? in the energy, 
but we, following Schrédinger, constructed first a nonrelativistic theory. 
We assume that any material particle of mass m moving at speed v has 
associated with it waves of frequency » given by 


hy =im?+P=E (11.1) 


where h is Planck’s constant and P is the potential energy of the particle. 

The physical nature of these waves was left indefinite by de Broglie. 
We cannot suppose that a material particle is just a group of waves; 
for then its mass and energy (and also its charge if it has one) would be 
spread out over these waves and would, in consequence, soon become 
scattered widely in space. It is of the nature of waves to diverge toward 
all sides. We should not be disturbed, however, by the impossibility 
of visualizing the waves. We should remember our experience in optics. 
Using classical theory, it was easy to picture light as wave motion; but 
if we retain this concept, it is very difficult to picture a beam of light as 
a moving stream of photons. Similarly, as long as we retain the particle 
concept of an electron or proton, we cannot form a concrete picture of 
the accompanying waves. Perhaps, even, they are only mathematical 
waves, so to speak, a device that we employ for the purpose of making 
calculations and predicting the results of observation. In adopting 
this standpoint, we are doing only what has been done, in various ways, 
many times previously in physical science. Strictly speaking, we have 
no very exact knowledge of the fundamental nature of a magnetic field, 
and yet we do not hesitate to use the symbol B with all due familiarity. 


11.2 Mechanics as Geometrical Optics of the Waves 


If waves are associated with all material particles and play a part in 
determining their motion, there should be a parallelism between the 
laws of mechanics and the laws of wave motion. For the particles 
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certainly obey the laws of mechanics, at least in some cases. It was 
pointed out long ago by Hamilton that there does exist a close parallelism 
between the laws of mechanics and the laws of ordinary geometrical 
oplics. De Broglie suggests, accordingly, that the familiar laws of particle 
mechanics may represent an approximation which is valid under such 
circumstances that the laws of geometrical optics hold for the matter 
waves. , 

One of the fundamental laws of ordinary geometrical optics is that, 
in a homogeneous medium, Jight travels in straight lines, or rays. This 
assumption is very nearly correct as long as the lateral dimensions of the 
beam are large compared with the wavelength of the light. Under these 
conditions one might say that the particle characteristics of light appear 
to predominate, although the rectilinear propagation of light is also 
consistent with the wave theory. When, however, the cross sectiou of 
the beam is of the same order of magnitude as the wavelength of the light, 
rectilinear propagation no longer holds, diffraction phenomena are 
observed, and undulatory characteristics of light predominate. If we 
carry the similarity over to mechanics, might we not expect that, for 
very small particles of matter, the ordinary laws of mechanics—found 
by Newton to be applicable to the phenomena of macromechanics— 
would fail, and, by analogy with light, we should find that matter shows 
undulatory properties in the realm of micromechanics? 

If we assume that the phenomena of ordinary mechanics constitute 
those of geometrical optics for matter waves, we can find the properties 
these waves must have in order that, under suitable conditions, the laws 
of mechanics may hold. Geometrical optics is based upon two laws, 
those of reflection and of refraction. The law of reflection as applied to 
matter waves presents no difficulty, for it is essentially the same as the 
law governing the rebound of a elastic body froma hard wall. Refraction 
corresponds to the deflection of a particle by the action of forces. 

To take a mechanical phenomenon which imitates a simple case 
of refraction in geometrical optics, let a pencil of electrons from a suitable 
gun G (Fig. 11.1) enter through orifice a an enclosed metal box A, the 
potential of which relative to the filament of the gun is V. Let these 
electrons emerge from A through orifice b and enter through c another 
box B, which is maintained at a potential V + AV. The electric field 
between the two boxes changes the component of velocity of the elec- 
trons perpendicular to the adjacent surfaces, and the electrons enter B 
with a change in their direction of motion. Let v, and vs (Fig. 11.1) 

be the velocities of the electrons in A and B, respectively, and 04, On the 
angles between these directions and the normal to the box faces at b 
andc. Since the electric field does not change the horizontal component 
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V+AV 


of velocity, 


va sin 04 = Up SiN Op 
Now if we were dealing with light waves undergoing refraction, or any 
other kind of wave, the relation would be 


sin 04 _ ua 
sin 62 Up 


where uz is the relative refractive index of the two media and u4, Us are 
the corresponding phase velocities. Comparison of the last two equa- 
\tions gives the result u4/us = vs/va. We may conclude that if matter 
waves follow the electron along its path, the wave speed u is inversely 
proportional to the speed v of the electron, or 


b 
u=? (11.2) 
where b is a constant during the motion that remains to be discovered. 
Since if E is the total energy of the electron and P its potential energy 
(here —eV), we have 4m? = E — P, and since for matter waves we 


assume that E = hv, we can also write (11.2) in the form 


u= ole z| =b [sa zl (11.2a) 


Here b may be different for different values of E or v. i 


~ 


eects, 
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The same equation must hold for matter waves accompanying any 
kind of particle, although perhaps with a different value of b. It is 
evident that the wave speed u will vary along the path whenever P 
varies. Furthermore, u will vary with the frequency v, so that the waves 
will exhibit the phenomenon of dispersion, even in free space, where 
P=0. 


Parenthetically, it should be remarked that the well-known 
‘electron microscope does not depend for its operation upon such wave 
| properties of the electrons. In this microscope the electrons move 
essentially according to classical mechanics, although their motion is 
k | sometimes treated in terms of the equivalent refractive index described 
(above. The only connection that matter waves have with the elec- 
` tron microscope is that their wavelength sets a limit to the possible 
‘resolving power that can be attained, just as the resolving power of 
(ordinary microscopes is limited by the finite wavelength of light. 
| The wavelength of the electron waves, however, we shall presently 
| find to be so small that the resolving power of existing electron micro- 
` scopes is limited by other factors. 


11.3 Phase and Group Velocity 


The wavelength to be expected for matter waves can be discovered by 
developing another suggestion—likewise due to de Broglie. It follows 
from Eq. (11.2) that an individual wave cannot stay with the particle 
permanently, since u and v usually differ. Whenever the wave velocity 
(or phase velocity) varies with frequency, a finile group of waves moves 
with a velocity different from the phase velocity. This phenomenon is 
easily observed on water. Close inspection of a group of waves advancing 
over a water surface will show that the individual waves advance twice 
as fast as the group as a whole; new waves continually arise at the rear 
of the group, pass through it, and die out at the front. Let us assume, 
therefore, with de Broglie, that the group velocity of matter waves is 
equal to the particle velocity, so that a group of them can accompany 
the particle in its motion. 

An ideal plane wave traveling in the z direction can be represented 
by the equation 


ve Av cos 2x (ot = $) = Av cos (wt — kz) (11.3) 


where Ao is the amplitude, w = 2x», and k, the propagation number, 
is 2x/d. The phase velocity is u = vd = w/k for this wave. 
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When the phase velocity varies with wavelength, a finite group 
of waves and energy is propagated at the group velocity v. To find 
the group velocity for waves of angular frequency near w, it is sufficient 
to consider two angular frequency components which we select to be 
w — Awand w + Aw. Let the two components have the same amplitude 
A and he given by 


WV, = A cos [(w — Aw)t — (k — Ak)z] 
Y, = A cos [(w + Aw)t — (k + Ak)z] 


Adding gives the resulting displacement 


Y = A {cos [(w — Aw)t — (k — Ak)a] 
+ cos [(w + Aw)t — (k + Ak)z]} 
= 2A cos (wt — kz) cos (Awt — Akz) (11.4) 


where we have made use of the fact that 
cos (9 + ¢) = cos 0 cos ọ F sin 9 sip y 


letting 0 = wt — kz and ọ = Awt — Akr. 

The two cosine terms show the presence of beats in the resulting 
wave (Fig. 11.2). For a constant phase point, wt — kx = const and 
dx/dt = w/k = v) is the phase velocity u. 

Setting the argument of the second cosine equal to a constant 
gives Awt + Akr = const, and dz/di = Aw/Ak = Af Ad is the velocity 
of the wave envelope, or the group velocity v. For a particular frequency 


Ai = 27/(k + Ak) 


Fig. 11.2 The phase and group velocities of two waves differing slightly in wave- 
length and frequency. 


284 Introduction to Modern Physics 


v, the group velocity v is 


Bi Aca Oto, 500 Pompe een gD, 
v= lim cE ak 30/%) x IA (11.5) 


Since w = ku, we may also write 


_ o(ku) _ du__ du 
= =utka u-r> 


(11.6) 


Here partial derivatives are written to indicate that u and v may vary 
with other factors than ); for example, the refractive index may not be 
uniform. 

In Fig. 11.2 the wavelets travel with phase velocity u and have 
wavelength essentially à and frequency essentially v. The modulation 
envelope, on the other hand, has the group velocity v and the number 
of beats per unit time is the frequency difference 24v, so that the modula- 
tion-wave frequency is Av. Consequently, the wavelength Am of the 
modulation wave is v/Ay, and the modulation wavenumber km = 27/)m 
is just Ak by Eq. (11.4). Therefore, ìm = 2r/Ak. 


11.4 The de Broglie Wavelength 

Following de Broglie, we seek a relationship between the velocity v of a 
particle of mass m and its wavelength à, assuming that v corresponds 
to the group velocity. Differentiating Eq. (11.1) with P held constant 
yields 


h ðv ðv 


ON ay 
Eliminating »/ðń with the aid of Eq. (11.5) yields 


ov _ hd 
D N 


and integrating, 
hl 
=—>+C 
v= mx F 
The value of C cannot be uniquely determined. Making the simplest 
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assumption, that C = 0, we find 


h 
mo 


h 
A= = 11.7 
5 (11.7) 


where p = mv, the momentum of the particle. Then 


Wavelengths given by Eq. (11.7) are known as de Broglie wave- 
lengths. The equation holds for photons as well as for material particles; 
for the momentum of a photon is hy/c = h/d. Equation (11.7) combines 
corpuscular and undulatory concepts in a very intimate way; for à has 
a clean-cut meaning only in connection with a wave theory, and p, 
the momentum, is most naturally associated with a moving particle. 

We can now compute the wavelengths to be expected for electron 
waves, atom waves, or molecule waves. For an electron starting from 
rest and accelerated through a modest potential difference V, the kinetic 
energy K = Ve = m? and the momentum p = mv = +/2meV, from 
which 


h_ 
e Se (11.8) 


For 100-V electrons, à. = 1.23 A; for 10,000-V electrons, \. = 0.123 A. 
When K = Ve is not negligible compared to moc’, Eq. (11.8) is no longer 
valid and must be replaced by 


nate h Para (11.8a) 


The de Broglie wavelength can be calculated from Eq. (11.8a) 
[or (11.8) when K < mec?) for any particle, such as a proton, a helium 
atom, or even a baseball. The larger the mass, the shorter the wavelength 
at given speed. De Broglie’s speculations aroused great interest. Soon 
experiments designed to test the hypothesis that electrons exhibit wave 
pruperties were launched by several physicists. On the theoretical 
front Heisenberg and Schrédinger discovered how to develop mathe- 
matical theories which, with additions by Born, Dirac, and others, 
have become the highly successful quantum mechanics of the present 
day. j 
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11.5 Experiments on Electron Waves 


a. Reflection from a Crystal Experiments on electron diffraction were 
reported by Davisson and Germer 3 years after de Broglie’s first paper 
appeared. They were studying the reflection of electrons from a nickel 
target and accidentally subjected a target to heat treatment that trans- 
formed it into a group of large crystals. Anomalies then appeared in 
the reflection from it. Following up this lead, they prepared a target 
consisting of a single crystal of nickel and bombarded its surface at 
normal incidence by a narrow pencil of low-voltage electrons; by means 
of a suitable collector of small aperture, they studied the distribution 
in angle of the electrons reflected from the crystal. In this reflected 
beam, they found striking maxima and minima, which they were able 
to explain in terms of diffraction of the electron waves. 

The diffraction of such waves by a crystal is very similar to the 
diffraction of x-rays, Crystallographic studies by means of x-rays show 
that the nickel crystal is of the face-centered cubic type, as shown in 
Fig. 11.3; i.e., the crystal can be imagined to be constructed of cubical 
unit cells having an atom at each corner and one in the center of each 
face, with none inside the cube. The atoms are indicated by circles in 
the figure, certain ones being joined by lines in order to outline the unit 
cells. The length ao of the edge of the unit cube is 3.51 A. Figure 
11.36 shows a face cut on the crystal at right angles to one of the diagonals 
of the cube. 

From the surface layer of atoms in the triangular face of Fig. 11.3c, 
it is readily seen that these atoms are arranged in rows parallel to one 


(a) \ (b) 
Fig. 11.3 (a) A nickel crystal showing face-centered cubic structure. (b) Same, 


with a face cut at right angles to a diagonal. (c) An incident beam of electrons is . 
scattered backward. 
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side of the triangle. The distance D between these rows is 2.15 A. 
We may regard these rows of atoms as equivalent to the lines of a plane 
grating of grating space D. Radiation of wavelength ) incident normally 
on such a grating, the plane of incidence being taken normal to one side 
“of the triangle, should be diffracted, as is light from a reflection grating, 
according to the well-known grating law 


nd = D sin ọ (11.9) 


where ¢ is the angle between the normally incident beam and the dif- 
fracted beam and n is the order of diffraction. The crystal is equivalent 
to a series of such plane gratings piled one above the other. Radiation 
penetrating to, and diffracted backward from, any one of these underlying 
layers is combined with that diffracted from other layers, with the result 
that a diffracted beam is observed at the angle ¢ unless the beams 
from the several layers happen to destroy each other completely. 

In the apparatus used by Davisson and Germer, the electron gun 
consists of a heated filament emitting electrons, an accelerating field to 
give the electrons any desired velocity, and a series of collimating aper- 
tures to produce a (nearly) unidirectional beam. This monovelocity 
beam of electrons strikes the surface of the crystal at normal incidence, 
and the electrons are reflected, or scattered, in all directions. A collector 
for mcasuring the reflected electrons is so arranged that it can be adjusted 
to any angular nesition with respect to the crystal. The collector has 
two walls insrl;ted rom each other, between which a retarding potential 
is applied so that only the fastest electrons—those possessing nearly the 
incident velocity—can enter tne inner chamber and be measured by the 
galvanometer. The crystal can be turned about an axis parallel to the 
axis of the incident beam, and thus any azimuth of the crystal may be 


presented to the plane defined by the incident beam and the beam enter- 
ing the collector. 


If a beam of low-voltage electrons is incident on the crystal, turned 
at any arbitrary azimuth, and the distribution of the scattered beam is 
measured as a function of the colatitude—the angle between the incident 
beam and the beam entering the collector—a curve similar to that in 
Fig. 11.4a is obtained, which refers to incident 36-V electrons. If now 
the crystal is turned to the A azimuth, the distribution curve for 40-V 
electrons (Fig. 11.46) shows a slight hump at about colatitude 60°. 
With increasing voltage this hump moves upward and develops into a 
spur which becomes most prominent at 54 V (Fig. 11.4e) at which 
voltage the colatitude of the spur is 50°. At higher voltages the spur 
gradually disappears. 

The spur in its most prominent state of development offers con- 
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Fig. 11.4 The development of the diffraction beam in the A azimuth, showing 
maximum length of spur for 54-eV electrons at colatitude 50°. 


vincing evidence for the existence of electron waves. From Eq. (11.5) 
the de Broglie wavelength of 54-V electrons should be 


_ 12.27 

54 

From Eq. (11.9) we find as the observed wavelength 
he = D sin ẹ = 2.15 sin 50° = 1.65 A 


The two values of à. are in excellent agreement. 

A spur was also found at 55° colatitude with 181-V electrons. 
This was interpreted as a second-order beam. For 181-V electrons, 
às = 0.91 A, whereas second-order diffraction gives 0.88 A. Over 20 
such beams were reported, in three different azimuths. With improved 
apparatus, the agreement between observed and de Broglie values of 
wavelength was better than 1 percent. 


N =1.67Å 


b: Transmission through a Crystal Given sufficient energy, electrons 
pass readily through thin films of matter such as metal foil or thin mica. 
If an electron beam has wave properties, then in its passage through 
matter we should observe some or all of the phenomena characteristic 
of the similar passage of x-rays. 

Soon after the discovery of electron waves, Kikuchi succeeded in 
obtaining electron diffraction patterns by passing a pencil of 68-kV 
electrons through a thin mica crystal. These observations are the exact 
analog of the first experiments on the diffraction of x-rays by Friedrich, 
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(a) (b) 


Fig.11.5 (a) Diffraction pattern produced by passing x-rays through 
a thin film of aluminum. (Courtesy of Mrs. M. I. Read, Bell Tele- 
phone Laboratories.) (b) Diffraction pattern produced by passing 
48-keV electrons through a silver foil 500 A thick. (Courtesy of Dr. 
L. H. Germer, Bell Telephone Laboratories.) 


Knipping, and Laue. About the same time G. P. Thomson (son of 
J. J. Thomson) passed a comparable pencil of electrons through thin 
foils of polycrystalline materials. Such foils are composed of a large 
number of randomly oriented microcrystals, and the observed diffrac- 
tion patterns consist of a series of concentric rings instead of spots 
(Fig. 11.5). The similarity between the patterns for electrons and those 
for x-rays is striking. 

Measurements of electron wavelengths have now attained such a 
degree of precision that they are regarded as a valuable source of informa- 
tion on the values of the fundamental constants. If the speed v of 
the electrons is measured by a kinematical device, observations of \ 
or of h/mv furnish values of h/m; whereas. if the speed is calculated from 
an accelerating voltage V, so that mv?/2 = eV, observations of à are 
best regarded as furnishing values of h/+/em. 


11.6 Diffraction of Neutrons and Molecules 


Even molecules exhibit wave properties. A molecular beam may be 
formed by allowing molecules of a gas to stream out of an enclosure 
through a small hole or slit into an evacuated chamber. Two difficulties 
have to be overcome, however, which do not arise in working with 
electrons. The molecules issue with a maxwellian distribution of veloc- 
ities, whereas for diffraction experiments a beam of uniform velocity is 
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desirable, corresponding to monochromatic waves. Then, too, neutral 
molecules are very much harder to detect than charged particles. 
Especially interesting is an experiment performed in 1931 by 
Estermann, Frisch, and Stern. They managed to select a beam of 
helium molecules or atoms having fairly uniform velocities by passing 
the beam through narrow slits in two parallel circular disks placed 3 cm 
apart, rigidly connected together and rotated about their common 
axis. The slits in the disks were adjusted so as to be opposite each other. 
An atom, after passing through a slit in the first disk, would arrive at 
the second disk too late to pass through the corresponding slit in that 
disk; but, if its velocity was just right, it would be in time to pass through 
the nezl following slit. Atoms moving faster or slower would arrive too 
soon or too late and would be stopped by the disk. After leaving the 
second disk, the beam fell upon the surface of a lithium fluoride crystal, 
by which it was reflected or diffracted. To measure the intensity of 
the beam diffracted in a given direction, the helium atoms were allowed 
to pass through a small hole into a chamber where they accumulated 
until a certain pressure of helium was reached (of the order of 10-6 cm Hg). 
This pressure was measured by the cooling effect of the helium on an 
electrically heated metal strip, owing to conduction of heat through 
the helium; the electric resistance of the strip was measured with a 
Wheatstone bridge as an indication of its temperature. 

A strong diffracted beam of helium atoms was observed in addition 
to the regularly reflected beam. In the most precise measurement, the 
maximum in the diffracted beam was found at 19.45°, corresponding to a 
wavelength of à = 0.600 A as calculated from the usual formula for 
diffraction by a crystal. From the dimensions and rate of rotation 
of the disk, the velocity of the helium atoms was calculated to be'1.635 X 
10° m/s; the corresponding de Broglie wavelength is, by Eq. (11.7), 
0.609 A. 

Neutrons in nuclear reactors which have experienced a large nuraber 
of collisions have a maxwellian velocity distribution characteristic of \the 
reactor temperature. A typical neutron at room temperature has ‘an 
energy of about 0.025 eV and a wavelength of about 1.8 A. Suth 
neutrons undergo Bragg reflection at the crystal planes of typical crystals 
at appropriate angles. The de Broglie wavelength of the neutron make 


possible the use of neutron spectrometers and permits the physicist to` 
obtain beams of almost monoergic neutrons. 


11.7 Electrons and the Wave Function 


As we saw in Sec. 11.5, beams of electrons and of photons behave in 
essentially identical ways when they pass through thin foils of various 
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materials. To further emphasize the common behavior of electron 
waves and photons of the same wavelength, let us consider the following 
experiment. Imagine a pair of slits (Fig. 11.6) through which we can 
alternatively send beams of photons and of electrons of comparable 
wavelength from either an electron gun or a photon source. Assuming 
that the wavelengths and dimensions of the apparatus have been suitably 
chosen to emphasize the wave aspects of the electrons and photons, we 
observe on a photographic plate (or other suitable detector) the pattern 
shown in Fig. 11.6b for both electrons and photons. In terms of the 
classical particle picture of an electron the expected pattern would 
consist simply of spots at P, and P», corresponding to electrons which 
pass directly through the slits and impinge on the detecting plate. 
Instead, we find photons and electrons exhibiting the same wave 
properties. 

We should obtain identical distribution of intensities if we used 
a little scintillator which moved along the plate in Fig. 11.6a and 
flashed each time an electron or photon struck it. The individual 
scintillations would be clear evidence that the photons or electrons had 
particle characteristics, since classical waves would produce a uniform 
glow proportional to the intensity, not individual flashes of light. 

The similarity of behavior of electrons and photons is further 
illustrated when we consider what happens if we close one of the slits. 
If only slit 1 is open, for either electrons or photons, the observed intensity 
pattern corresponds to curve 1; of Fig. 11.6c. Similarly if slit 1 is closed 
and slit 2 open, the observed pattern resembles I». The pattern with 


Slits 


Source 


(a) (b) 


Fig. 11.6 (a) If electrons behaved like classical particles, the pattern expected 
for electrons from the source passing through two slits would be spots at P, and 
P,. (b) The actual pattern observed on the plate is that expected from classical 
waves. (c) Patterns expected from waves through each slit if the other slit is 
closed. 
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both slits open is not the sum of the patterns of the two individual slits. 
At this point there arises a grievous dilemma for the person accustomed 
to thinking in terms of classical physics. He naturally assumes that 
if an clectron arrives at a detector when both slits are open, the electron 
must have gone either through the first slit or through the second. 
Even if one counts the arrival of individual particles by a scintillation 
detector, one obtains the curve of Fig. 11.6b, not the sum of the curves 
of Fig. 11.6c. _ If the electron or photon goes through only one slit, how 
can opening or closing the second slit possibly affect the trajectory? 
How could opening the second slit reduce the probability of any particle’s 
arriving at a specific point (such as P, on Fig. 11.6b) to zero, when 
each slit alone makes a positive contribution to the intensity at this 
point? 

This result is paradoxical only when we insist on a rigid classical 
view that electrons are particles. What is important is not what name 
they bear but how they react in experiments. “Particle” is a convenient 
name for an object which we can localize, but we use another name to 
describe its hehavior in a diffraction experiment. The fact confronting 
us here is that an.entity which can be located in space in some experi- 
ments can exhibit wave properties in others. Indeed, there is a con- 
tinuum between the two where neither property is well defined. The 
entire manifold has its analogy in classical optics, where one speaks of 
long wave trains with precisely defined wavelengths, or short light 
pulses, which can be located in space but for which wavelength cannot 
be defined. That this analogy is relevant to matter is the result of experi- 
ments; one makes progress in physics by explaining what actuaiy happens, 
not by explaining what one thinks ought to happen but does not. 

Classical wave theory explains the interference pattern of Fig. 11.6 
satisfactorily. Since electrons produce an analogous pattern, we shall 
tentatively assume that the explanation of the electron “interference” 
pattern can be developed in close analogy with the explanation for 
electromagnetic waves. For such waves the instantaneous intensity is 
proportional to the square of the instantaneous electric field strength. 
When two waves interfere, the resulting intensity is not the sum of the 
two intensities; rather the resulting displacement is the resultant of the 


individual displacements. Consider two electromagnetic waves with the 
electric vector E given by 


A cos 2r G = H) 
Az cos [2 k = 2) -e| 


Ei 


li 


(11.10) 


il 


E: 
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which inverfere to give a resultant displacement 


E=E6 +E: = A cos 2e (Z — vi) 


+ Á: cos 2r G — i) cos ¢ + A: sin 2r G — »i) sin } 


= VA + Ar F 2AA: cos $ cos | 2r (= — rt) +è) (11.11) 
A 


where 5 = A sin ¢/(A: + Az cos $). Thus the time average of the 
intensity over a full cycle is proportional to Ay? + A + 2Ai1A2 cos ¢. 
The corresponding intensity of wave 1 alone is proportional to Aı°, and 
that due to wave 2 is proportional to A.*. We see that the resulting 
intensity differs from the sum of the individual intensities by 2A1Az cos ¢, 
which is known as the interference term. 

In the wave description of electromagnetic waves the intensity is 
proportional to the square of the electric intensity E, while in the particle 
description the intensity is proportional to the number of photons N 
passing through a unit area in unit time. In free space the intensity 
I of a beam with a single frequency » is 


I = cegk? = Nhv (11.12) 


where c = speed of light 

eo = permittivity of free space 

h = Planck’s constant 
Clearly the photon flux NV is proportional to E?, since all other quantities 
are constant. When N is very large, we can measure E and expect to 
find E? a valid measure of N. Under this high-flux condition a scintilla- 
tion detector would show continuous and essentially constant response 
with the discrete nature of the photons washed out. However, as the 
flux is reduced until only one photon arrives each second, the particle 
nature becomes prominent and the wave treatment ceases to give a 
usellul description of the beam. However, the average value of E? over 
a reasonable time interval is a meaningful measure of the photon flux N; 
or alternatively, the value of E? dV is proportional to the probability 
of finding a photon in an infinitesimal volume element dV. 

We develop the analogy between photon and electron beams by 
assuming that the electrons in a beam traveling in the x direction with 
constant momentum pz are controlled by “pilot? waves with a wave 
function ¥ which plays the same role for electrons that the electric © 
i 
i 
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intensity plays for photons. Following de Broglie, we take 
y = Ae?) = Aeilrsz—E0!h (11.13) 


where we have made use of à = h/p and E = hv. This relation is 
analogous to Eq. (11.10), which is just the real part of Eq. (11.13). 
For the electron beam we are discussing Y is a complex number, while 
the electric intensity E is an observable quantity and hence real. That 
there should be differences in the wave descriptions of electrons and 
photons is not at all surprising, since the electron has rest mass, charge, 
and a different spin than the photon. Electrons and photons have 
analogous behavior in some ways but are very different in others. 

The question at once arises: What is the physical significance of 
the mysterious ¥? An answer sometimes given to this question is that 
Y is merely an auxiliary mathematical quantity that is introduced to 
facilitate computations relative to the results of experiment. For exam- 
ple, in the experiments described above on the diffraction of electrons 
by a crystal, the experimenter sets up an electron gun that fires electrons 
of a certain energy at a crystal. A detector placed at a certain angle 
gives indications which are taken to mean that electrons are being 
received. In order to develop a quantitative theory for such observa- 
tions, we assume that a beam of electron waves of frequency v = E/h 
falls on the crystal, and we calculate the intensity of the waves scattered 
in the direction of the detector. From this we infer, following certain 
rules, what the indication of the detector should be. In order to make 
such calculations, it is not really necessary to attach any physical signifi- 
cance at all to the mathematical symbol W. 

There is much to be said for such a view. After all, observational 
results are the primary material of physics; the purpose of theories is to 
correlate these results, to group them into those regularities of experience 
which we call laws, and to predict the results of new experiments. Even 
the motion of the electron as a particle is only an auxiliary concept, 
introduced for convenience in describing and interpreting observational 
results. Such auxiliary concepts as particles constitute convenient aids 
to thinking, and most physicists find it advantageous to make use of 
them. Whereas some experimental results are conveniently understood 
in terms of the electrons as particles, however, others can be understood 
only in terms of the interference of waves. It is worthwhile, therefore, 
to go as far as we can toward assigning some physical significance to 

these waves or to the quantity ¥ in terms of which the mathematical 
theory of the waves is expressed. It turns out that this can be done, 
according to circumstances, in two rather different ways. j 

1. In dealing with an experiment on scattering such as that just 
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described, a beam of waves of indefinite total length is commonly assumed, 
represented by a W like that in Eq. (11.3). The square of the absolute 
value of ¥, |W|?, is then taken lo be proportional (for given particle energy) 
to the number of particles in the beam that cross unit area per second, the 
unit area being taken perpendicular to the direction of motion.! Thus, 
in an experiment on the diffraction of electrons, let no elegtrons cross 
unit area per second in the incident beam and let Yo denete the mathe- 
matical expression assumed for ¥ in the waves representing this beam. 
Then, if calculation by means of wave mechanics gives Vz as the value 
of W in a certain direction of diffraction, the theoretical value for the 
intensity of the diffracted beam in that direction, defined as the number 
of diffracted electrons crossing unit area per second, is 


— lal? 11.14 
Na = [To]? No 5 (11.14) 


[provided the incident and the diffracted beams move in regions where P 
in (11.1) has the same value]. This procedure corresponds exactly to 
the method of handling similar problems in optics. The intensity cf a 
beam of light, which might be defined as the number of photons crossing 
unit area per second, is proportional (in a given medium) to E?, the square 
of the electric vector. The principal difference here is that, Y being 
a complex number, we must use |¥|? instead of ¥?. Usually ¥*W is 
written instead of |¥|?, Y* denoting the complex conjugate of Y. 

Returning now to the problem of the two-slit-interference pattern, 
we see that if Eq. (11.13) is indeed applicable to electrons, we should . 
expect the curve of Fig. 11.6. At any point on the screen there is a 
contribution ¥, from the first slit and a contribution Y+ from the second 
slit, so that the resulting wave-function probability amplitude is 
Y = Y, + $.. The probability of an electron’s arriving at this point 
on the screen is proportional to w*¥. If there are places where 
Y, = —W2, Y*W goes to zero and.an electron interference pattern is to 
be expected. 

2. There are other cases, as in the photoelectric effect, in which 
one wishes to follow the flight of a single electron or other particle. Then 
¥ is taken to refer to a single particle. Usually, in a given case, values 
of ¥ appreciably different from zero occur only within some finite region. 
A solution of the wave equation of this latter type is sometimes called a 
wave packel. In such a case it is natural to ask: Where is the particle in 
relation to the wave packet? The accepted answer may be stated in 
terms of probabilities. The position of the particle is usually considered 


1 This flux relation is special for the case in hand. See Sec. 12.7 for a general 
relation. 
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nol lo be defined any more closely than is indicated by the values of Y. r At 
any given instant, if a suitable observation were made, the particle might 
be found at any point where ¥ is different from zero; the probability of 
finding it in the neighborhood of a given point is proportional to the value 
of |W? at that point. More exactly, the probability density at any point 
is represented by |¥|?; the probability of finding the particle within any 
element of volume dz dy dz is 


w*y dz dy dz (11.15) 


The wave scalar W itself is sometimes called a probability amplitude for 
position of the particle. 

The interpretation just stated imposes upon W a certain mathe- 
matical requirement. For the (otal probabilily of finding the particle 
somewhere is, of course, unity. Hence ¥ must satisfy the condition that 


SS Jwtw de dy dz = 1 (11.152) 


the triple integral extending over all possible values of z,y,andz AW 
satisfying this requirement is said to be normalized (to unity). 

There are also interesting relations between a wave packet and the 
velocity or momentum of the particle. A detailed study brings out the 
following features. Suppose the particle is in free space with P = 0, 
so that E = p?/2m. Such a particle cannot be represented by Eq. 
(11.13), which denotes an infinite train of waves, since it makes 


SSJ¥*Y dz dy dz = œ 


unless A = 0. Thus we conclude that il is nol possible for a parlicle 
lo have a perfectly definite momentum or velocity. We can, however, 
have as close an approach as we please to a definite momentum, for Y 
may be assumed to be indistinguishable from (11.13) over as large a 
region as desired, sinking to zero outside the boundary of this region; the 
constent A can then be chosen small enough so that f f fY*Y dz dy dz = 1. 
But ien the position of the particle is not well defined, since ¥*¥ is 
finite over a considerable volume. 


11.8 The Heisenberg Uncertainty Principle 


De Broglie’s proposal that the motion of a particle with velocity % is 
controlled by pilot waves with group velocity v suggests that pa 
a limit beyond which we cannot determine simultaneously both the 
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momentum and the position of the particle. Suppose that we use the 
two waves of Fig. 11.2 to represent the pilot wave for a particle, assuming 
that the particle is somewhere in the shaded half wave of the envelope. 
Then the uncertainty in position Az corresponds to },/2, from 
which Ax = 7/Ak (Fig. 11.2). From k = 2x/X = 2xp/h, we have 
Ak = (2x Ap)/h. The propagation numbers of the two waves differ by 
2Ak, so that the uncertainty in the z component of the momentum 
Ap; = (2h Ak)/2x, whence 


Az Ap. = h (11.16) 


It is not surprising that our choice of two beating waves of slightly 
different frequency is far from an optimum way in which to choose the 
pilot waves for the motion of a particle. It gave us (Fig. 11.2) a beat 
pattern which resembles a row of beads. It is possible (Sec. 12.5) to 
eliminate all but one “bead” by adding other frequencies (or k’s) and 
thus devise a wave packet which can be localized to any Az we choose. 
But even when a wave packet of optimum shape is selected, it is impossible 
to reduce both Az and Ap, to zero simultaneously.! Indeed, regardless 
of the wave packet chosen, 


à 
At Ap: > = (11.17) 


This fact was first enunciated in 1927 by Heisenberg, who called it the 
principle of Unbeslimmtheit. This term has been variously translated 
as indeterminacy, indefiniteness, or uncertainty. 

The indefiniteness we have just found to exist in the values of 
certain mechanical magnitudes associated with a particle, such as its 
position or momentum, is a fundamental feature of wave mechanics. 
The indefiniteness in position can be minimized by making the wave 
packet very small (¥ practically zero except within a very small region); 
but in that case it can be shown that the packet will spread rapidly 
because a broad range of propagation numbers is required. Thus, a 
small packet means a large indefiniteness in momentum and velocity. 
On the other hand, if we give to Y a form like that in Eq. (11.13) over 
a large region, in order to fix the velocity and momentum of the particle 
within narrow limits, there is a large indefiniteness in the position. 

The uncertainty principle has an analog in the field of optics. A 
single sinusoidal wave of light of wavelength à represents a certain 

1 Unfortunately we need certain developments in Chap. 12 to define Az and 
Apz quantitatively. Until this is done in Sec. 12.6, the quantitative aspect of Eq. 
(11.16) must remain vague. 
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amount of energy that is closely localized in space, but it does not 
constitute monochromatic light; for, upon passing through a spectroscope, 
it will spread widely in the spectrum (see Fig. 11.7). To have an approach 
to monochromatic light, we must have a train of many waves; and this 
means a corresponding dispersal of the energy in space. In general, 
any wave packet can be expanded in terms of wave trains like (11.13), 
just as any patch of light waves can be resolved into monochromatic 
trains. This amounts to representing Y by a Fourier integral. When 
this has been done in a suitable way, the coefficients of the various wave 
trains in the expansion constitute 2 probability amplitude for momen- 
tum; i.e., the square of the absolute value of any cocilicient gives the 
probability that a suitable observation would reveal the particle as 
moving in the direction and with the momentum or velocity that is 


NJAN 
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A 
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(a) (b) 


Fig. 11.7 A single sinusoid (a) of wavelen 
a gth Xo analyzed h 
has wavelength components over a broad region AET bya: pares 


four-wave train (b) leads to a much higher peak at do (curve Jp). 
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associated with the corresponding wave train. Thus, a particle does not 
have either a sharply defined position or a sharply defined momentum. 

The conclusion that a wave packet cannot represent a particle as 
having at the same time a definite position and a definite momentum 
might seem to be in conflict with the fact that, in practice, both position 
and momentum are capable of measurement; e.g., from two snapshots 
of a rifle bullet, its position and velocity at a given instant can both be 
calculated. Heisenberg pointed out, however, that this is possible only 
because, on the scale of observation used in ordinary physical measure- 
ment, the indeterminacy required by Eq. (11.17) is so minute as to be 
lost in the experimental errors. . 

It is quite otherwise for an electron or a molecule. Consider, for 
example, how an electron might be located with alomic precision. We 
might use a microscope; but then we should have to use light of extremely 
short wavelength in order to secure sufficient resolving power. To dis- 
tinguish positions 10-1! m apart, for example, we should have to use 
yravs. Under these circumstances, however, the effect of the light on 
the electron cannot be neglected. If we are to “sel 
least one. photon must bounce off it and enter th 
rebounding Yrom the electron this photon will give it a strong Compton 
kick. Thus at the instant when we locate the electron, its momentum 
undergoes a discontinuous change. Furthermore, there is an indefinite- 
ness about the magnitude of this change, for it will vary according to 
the direction in which the scattered photon leaves the scene of action. 
We cannot limit closely the range of possible directions for the scattered 
photons that enter the microscope, by stopping down the aperture, 
without a serious loss of resolving power. A quantitative analysis of 
such observations leads again to Eq. (11.17). 

It appears, then, tha 
or other small particle 
definite Position and a 


e” the electron, at 
he microscope. In 


t we cannot al the same lime assign to an electron 
» in terms of actual or possible observations, a 


- Thus, an asser- 


ns or molecules. These small 
icle properties, but they also 
, in the classical sense of the 


words, they are neither true particles nor true waves, C. G. Darwin 


proposed calling them wavicles. 
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The Heisenberg uncertainty relationship applies to any pair of 
variables which are canonically conjugate in the hamiltonian formulation 
of mechanics. In addition to Az and Ap, we have Ay Ap, 2 h/2, 
Az Ap, > h/2 for cartesian coordinates; AE Al > h/2, where AE is the 
uncertainty in energy and Al that in time; A0 AA, > h/2, where AA» 
is the uncertainty associated with the angular momentum A» and A0 
the indefiniteness in the corresponding angle; and so forth. 


11.9 The Schrödinger Wave Equation 


If a particle has wave properties, as de Broglie proposed, it is expected 
that there should be some sort of wave equation which describes the 
behavior of the wave function. The usual basis of the mathematical 
theory of wave motion is furnished by a differential equation. Thus 
an electromagnetic wave moving in the z direction can be described by 


d7E,(z,t) _ 1 Elz) 
ðr c?_—sO? 


in which E, is the y component of the electric intensity. 

We cannot follow a logical development from first principles to 
find the differential equations for matter waves, but we can guess a wave 
equation in harmony with the properties which we find the waves to 
have and then test our guess by comparison of experimental data with 
predictions deduced from this wave equation. Let us assume that Eq. 
(11.13) is an appropriate representation of a plane wave moving in the 
zx direction, so that 


W = Aeilrsr-E0/h (11.13) 
in which Æ and p for the particle “piloted” by Y are related by 


p?? 8 
mt P=E (11.18) 


where P is the potential energy of the particle and p,2/2m is its kinetic 
energy. Whatever the correct differential equation for ¥ may be, it 
should be such that any allowable function expressed in the form of 
Eq. (11.13) is a solution of it, regardless of the value of E or p. Let us 
endeavor, therefore, to find by trial a differential equation such that, 
when the expression just given for ¥ is substituted in it, Æ and P cancel 
out in consequence of the known relationship expressed in Eq. (11.18). 


oa 
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A few derivatives of ¥ as obtained from Eq. (11.13) are 


oy i 

a zPY (11.19a) 
or 1 

Sot = gP Y (11.19b) 
ov i 

aes 11.1 

i i EW (11.19¢) 


If Eq. (11.18) is multiplied by ¥ to give p?v/2m + PY = EW 
and use is made of Eqs. (11.18b, c), we have the equation, first obtained 
by Schrédinger in 1926, 


nh? aw hav 
es ery pL a (11.20) 

Equation (11.20) is Schrédinger’s famous wave equation containing 
the time. The agreement of results deduced from this equation justifies 
the belief that the equation is valid for the matter waves associated with 
a particle of mass m as long as relativistic effects can be neglected. The 
equation is remarkable among the differential equations of mathematical 
physics in that it contains i= +/—1. But for this factor it would 
resemble the equation for the flow of heat in a solid body, which likewise 
contains the first derivative with respect to time. The motion of matter 
waves combines, in fact, some of the features of heat flow in a solid 
with other features resembling the propagation of mechanical disturb- 
ances, such as sound waves. 

Let us consider Eq. (11.20) for the case in which E, pz, and P are 
all constants satisfying Eq. (11.18) and A is a constant representing 
the wave amplitude. If Æ > 0 and p > 0, this solution of the wave 
equation represents plane waves of © traveling toward +z; for at a 
point moving with velocity E/p., the value of ¥ remains constant. 
E and p= represent energy and momentum of the associated particle, 
respectively; the frequency » and the wavelength à of the waves are easily 
seen to be v = E/h, \ = h/pz in accordance with conclusions reached 
previously. If p- < 0 but E > 0, both the particle and the waves are 
moving toward —z. 

If E is negative, as may happen if P is sufficiently negative, the 
phase waves and the particle travel in opposite directions. We can 
always make this so, if we wish, by adjusting the arbitrary additive 
constant that occurs in P, the potential energy. The possibility of 
reversing the direviicn of motion of matter waves by a mere mathematica! 
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change of this sort is a clear indication that they cannot be ordinary 
real waves! : : 

When the potential energy P is a function of z only and the total 
energy E is constant, we may write 


¥(z,t) = v(x) fO (11.21) 


and substitute this into Eq. (11.20). Upon dividing through by f(t) 
and making use of Eq. (11.19c) we have 


Smag aA (11.22) 


which is the one-dimensional time-independent Schrédinger equation, also 
known as the Schrödinger wave equation. 


For motion in three dimensions ¥ is a function of z, y, z, and l; 
the Schrödinger equation becomes 


he (dY ew, aw haw 
~ 2m \aa? ot t oat) + Pv = — 22% (11.23) 
or 
h ys oe Voy 
-mt + Pe = FF (11.232) 


It was early noticed that Schrédinger’s equation can be obtained 
from the classical hamiltonian expression for the energy of the moving 
particle by substituting suitable differential operators for energy and 
momentum, For example, Eq. (11.192) can be written 


from which we are led to the successful approach of replacing pz by the 
operator 

a 
i 


P: = 2 (11.242) 


where the “hat” is used to distinguish an operator from the physical 
quantity. Similarly, analogy and Eq. (11.19c) to make plausible the 
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differential operators 


ae +2 (11.246) 
Ê: = 22 (11.24¢) 
ius Z (11.24da) 

The total energy of a particle in the hamiltonian form is 
H = (p? + pè + pe) + PE (11.25) 
where H denotes the hamiltonian and P the potential energy. If we let 
A- -¥ (G+ er H)+e (11.26) 


and we multiply Eq. (11.25) by ¥, we have, upon replacing H, p., E 
etc., by their operators 


? 


p Ny = 2 KOF 
AY=EW or Bia ch PS sap 
Thus we have recovered the Schrédinger equation by a procedure which 
lies close to Schrédinger's original line of thought. 

Even though we have not been able to derive Schrédinger’s equa- 
tion, its fabulous success in dealing with problems of modern physics 
justifies our use of it in nonrelativistic applications, some of which we 
consider in subsequent chapters. 


Problems 


1. The ionization potential of helium is 24.6 V. Find the de Broglie 
wavelength of an electron of 24.6 eV kinetic energy. What is the ratio 
~ of this wavelength to the radius of the first Bohr orbit calculated on the 
assumption that the effective nuclear charge is given by 13.6 Z}, = 24.6 


Ans: 2.48 A; 6.3 


2. Assume that a particle cannot be confined to a spherical volume 
of diameter less than the de Broglie wavelength of the particle. Esti- 
mate the minimum kinetic energy a proton confined to a nucleus of 
diameter 10-1 m may have. What kinetic energy would an electron 


have to possess if it were confined to this nucleus) (This is one of the — 


arguments against the idea that free electrons exist in nuclei.) 
Ans: 8.2 MeV: 124 MeV 
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3. The half-life of a certain excited state is about 8 ns. If this is 
essentially the uncertainty Al for photon emission, calculate the uncer- 
tainty in frequency Av, assuming that AE At = h. Find Av/» if the 
photons have \ = 500 mz. 

Ans: 1.25 X 108 Hz; 2 X 107 
4. The Heisenberg uncertainty principle suggests that a particle of 
momentum p cannot be confined by a central force to a circle of radius 
r less than f/p. Assuming that r = h/p, write the total energy 
E = K + Pasa function of r for a hydrogen electron. Find the value 
ofr for which Æ is minimum and show that it is equal to the Bohr radius. 
Sketch E, K, and P as functions ofr. 
5. Imagine that the angular position of an electron initially in the 
ground state of hydrogen is determined by an experiment to an accuracy 
Ay = 0.2 rad. If AA, Ap ~ ħ, calculate the uncertainty in the angular 
momentum A, and compare it with the angular momentum postulated 
for the first Bohr orbit. Assuming for the moment that the electron 
continues in an orbit with the Bohr radius, find the uncertainty in the 
kinetic energy K and compare it to the ionization energy. Is the 
electron still in the ground state? (Remember the magnitude of a 
quantity is at least as great as the uncertainty in the quantity.) 

Ans: 5h; AK > 35K ion 


6. An electron has a de Broglie wavelength of 0.15 A. Compute the 
phase and group velocities of the de Broglie waves. Find the kinetic 
energy of the electron. 


7. Compute the kinetic energy of an electron which has a de Broglie 
wavelength equal to the wavelength of a Mo Ka photon (0.707 A). 


8. „Find the kinetic energy of a neutron which has a wavelength of 
3 A. At what angle will such a neutron undergo first-order Bragg 
reflection from a calcite crystal for which the grating space is 3.036 A? 
9. Calculate the de Broglie wavelength of (a) an electron, (b) a muon, 
(c) a proton, and (d) a Hg?% atom if each has a kinetic energy of 25 eV. 
10. Calculate the de Broglie wavelength for an average helium atom in 
a furnace at 400°K for which the kinetic energy is 3kT/2. 
Ans: 0.66 A 


11. In an experiment the wavelength of a photon is measured to an 

accuracy of one part per million; that is, AX/x = 10—86. What is the 

approximate minimum uncertainty Az in a simultaneous measurement 

«tthe position of the photon in the case of (a)-a photon with: = 6000 A 
-ew nhoton with X= 5 Ad 
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12. The radius of a typical atom is approximately 107° m = 1 A. 
To locate the nucleus or an electron within a distance of 5 X 107!? m 
by use of electromagnetic waves, the wavelength must be at least this 
small. Calculate the energy and momentum of a photon with 
\=5X 10-2 m. If Az is 5 X 10-!? m for an electron, what is the 
corresponding uncertainty in its momentum? This represents one of 
Heisenberg’s gedanken experiments illustrating the uncertainty principle. 
13. A nucleon, either proton or neutron, is confined to a nucleus of 
radius 4 X 10-5 m. If this implies an uncertainty Ax 2 8 X 107" m, 
what is the corresponding uncertainty in p:? Since the momentum 
must be at least as big as the uncertainty Ap:, estimate the kinetic 
energy and the speed of a nucleon by taking p = Ap: (Note that these 
are low values since we have overestimated Ar and underestimated p.) 
14. According to the theory of Abbe, a microscope can resolve the 
images of two objects if the objects are separated by a distance 
$ = M/(2n sin a), where n is the index of refraction of the medium between 
Pe eee microscope objective and a is the half-angle sub- 
limit of sie a the objective. (In practice, n sin a has an upper 
es <6.) IEn sin æ is 1.5 for the microscope in question and 
jish to resolve objects separated by 0.5 A, find the minimum energy 
required for (a) photons and (b) electrons, What are the corresponding 
momenta? 
15. A beam of 36-eV electrons is incident normally on a thin poly- 
crystalline metal film for which the greting space a is 2.5 A. If the 
depth ¢ of the potential well representing the metal is 4 eV, find the 
angular diameter of the smallest ring of the diffraction pattern formed 
by electrons passing through the foil. 


16. Derive Eq. (11-82) and show that (a) for K/moc? < 2, 


h K 3 K? 


rene t e a 
oak ( mec? + 33 mot + `` ) 


and (b) for K/moc? > 2, 


rch moc? , 3mo’ct 
zl- Et tie ) 


17. Show that Eq. (11.9) is equivalent to Bragg’s law nd = 2d sin 8. 
Hint: Show that d = D sin (¢/2), while sin 6 = cos (¢/2). i 


18. How long does the uncertainty principle allow a z° meson to exist 
as a p — p pair? 


` Disi imbir 
Wa ri a teh Pr 

pads hae 
Kaz Eons = And 


F sa 14 n 
eddy on eo E 
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eke 


7. 


chapter twelve 


Wave Mechanics I 
Free States 


Modern quantum mechanics has led to the solution of many problems 
which were classically intractable. The Schrodinger formulation is par- 
ticularly useful for our purposes, and we begin by considering the ware- 
mechanical solulions for the simplest one-dimensional examples of electron 
motion. 


12.1 Electron Beam in a Field-free Space 


Consider a region in which the potential energy P of an electron ig 
constant, so that there are no forces acting. Let a uniform beam of 
electrons be traveling in the z direction, each electron having momentum 
pz and total energy E = p,*/2m + P. For this beam the Schrödinger 
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wave equation is 
h? dy ae 
Stoica ai Py = Ey (11.22) 


where y is the wave function such that the probability of finding an 
electron between x and z + dz is y*y dz. From Eq. (11.22) we have 


fy, me y= 0 


Ny (12.1) 


which is the simple-harmonic-motion equation, since the coefficient of y 
is a positive constant. The general solution of Eq. (12.1) is 


i V2m(E — P)z 


p=, exp * ji 
22 Gexp [- n Pi] (12.2) 


The complete wave function (z,l) = ¥(x)f(l) satisfies Eq.‘-(11.11c) 
from which i 4 


df(l) he 

“ap ape (12.3a) 
and 

= Ger (12.36) 
Thus we find that 

W(z,l) = A exp iV 2m(E — P) z — El] 

k i h 
+ Bexp (ares o (12.4) 
i 2, 


is the general solution of Eq. (11.20). However, the second term of 
Eq. (12.4) represents a wave moving to the left, while we have postulated 
that the beam associated with Y is moving in the +z direction. There- 
fore B is zero and 


inne PSE Aei(pz-ED Ih 
p:z — El 3 o pet — El 
iia + iA sin come 


W(a,l) = A exp 


A cos 


I 


(12.5) 


[Note that we have retrieved Eq. (11.13).] 
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The probability of finding an electron between z and z + dz is 
given by the one-dimensional form of Eq. (11.15) 


E dz = ¥*Y dz = A? dz (12.6) 


As expected for a uniform electron beam, the probability © is not a 
function of z. The constant A is related to the intensity of the beam. 
If there are on the average n electrons in a length L, we must have 
A? = n/L since 

z+L 
‘hen wry de = |" Adr = AL =n 


Zine 


12.2 The Step Barrier 


Consider next a uniform beam of electrons of energy E coming from the 
left of Fig. 12.1. Let the potential energy P be 0 to the left of the 
origin and P, to the right. Such a rectangular barrier does not exist 
in nature, but it is reasonably approximate to the instantaneous potential 
difference between the dees of a cyclotron.or to the potential barrier 
at the surface of a metal. For a monoergic electron beam the time 
dependence of ¥(z,t) is given by Eq. (12.36) for all x and £; therefore, 
our first problem is to find the spatial wave function y(x), which must 
satisfy Eq. (11.22). 


Region 1 
i? dèy be 12.7 
- an di t0 = EV for —> <z <0 (12.7a) 
Region 2 
à d 
-È TS +Py= Ey for0<2<@ (12.7b) 


Fig. 12.1 When a beam of free electrons with kinetic energy K = E is 
incident on an ideal potential barrier of height Po > E, the beam is 
reflected. Shown in the rectangles are small sections of the real part of 
the incident and reflected waves. 
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The general solution for region 1 is 


ree Alap VEz + Bexp (— anes 
a h 
= Aez + Be-ik= (12.8) 


where A and B are constants to be determined and ki = »/2mE/h. 
The term at the right represents particles moving to the left, which arise 
from reflection at the barrier. For region 2 (0< z < œ) we must 
distinguish between two situations, according as K = E — P, is negative 


or positive. If it is negative, classical physics would forbid the electrons 


(or other particles) to enter region 2, while if E — P, = K is Positive, 
classical physics predicts that all the electrons would enter and pass freely 


through. Wave mechanics makes different predictions. 


case 1. E— Py <0 If P, exceeds E, the kinetic energy K is 


negative. Nevertheless, y is not zero in region 2 but is given by the 
solution to Eq. (12.7b), which is 


h= Gea a] 


+D exper Bs = Ce + De (12.9) 


where C and D are constants to be determined and a = V2m(Po — E)/h. 

Next we place two requirements on y and its derivatives in order 
that the probability y*y dz be finite and single-valued so that the wave 
function represents a specific physical situation: 


1 


The wave function is finite, continuous, and single-valued. 
2 


The first derivative (or gradient in several dimensions) of y is finite, con- 


tinuous, and single-valued at all points where the potential energy P is finite; 
if P goes to infinity anywhere, 


S n the appropriate boundary condition is found 
by starting with a finite P and taking the limit as P goes to infinity. 


From the requirement that y remain finite, it follows that D = 0; 
if it were not, yz would become infinite as z increases without limit. 
From the condition that y is continuous everywhere, we have at z = 0 

A+B=C (12.10) 
while the requirement that dy/dz be continuous at z = 0 yields 


iki(A — B) = —aC (12.11) 
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If we solve Eqs. (12.10) and (12.11) for B and C in terms of A, we find 
that the amplitude of the reflected wave B is related to that of the incident 
wave A by 


_ ki — ta 
B= era A i (12.12) 
while 
2k 
C= ERE (12.13) 


The wave entering region 2 is exponentially damped (Fig. 12.1); 
although there is a finite probability of finding electrons in the classically 
forbidden region, there is no steady transmission of particles there. In 
the steady state all incident particles are ultimately reflected. To see 
that this is true, it is convenient to write complex quantities in polar 
form, making use of the fact that kı + ia = petë, where p? = ki? + a? 
and tan ô = aæ/kı; Eq. (12.12) then becomes 


15 
B= A = FA (12.120) 
where ô = tan™ [(P) — E)/E}}. 
Substitution of Eq. (12.12a) into (12.8) yields 


yı = A (ez + e-28e-ihz) = Ae-* cos (AE Te i) (12.13a) 


which represents a standing wave in which equal numbers of particles 
are going left and right. This is just the classical behavior of electro- 
magnetic waves at a metal surface. 


case 2. E — P,)>0 When the total energy exceeds the poten- 
tial energy (Fig. 12.2), the kinetic energy K is positive and region 2 
is classically accessible to the electrons. In this case the general solution 
of Eq. (12.76) is 


Ya = Getz4 He-th= (12.14) 


where k: = »/2m(E — P>)/h. Here the first term represents a wave 
traveling to the right and the second a wave traveling to the left. Since 
we have assumed that we have electrons initially incident only from 
the left, we set H = 0. ; 
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Fig. 12.2 When a beam of free electrons with kinetic energy K = Eis incident 
on an ideal potential barrier of height Py < E, part of the beam is reflected and 
part transmitted. Small sections of the real part of the incident, reflected, and 
transmitted waves are shown in the rectangles. 


Applying the requirements that at z = 0 both y and its derivative 
are continuous gives 


A+B=G 


(12.152) 
kiA — kB = kG 


(12.156) 
| from which the amplitudes of the reflected and transmitted waves in 
terms of that of the incident wave are 


kik 
Bape (12.16) 
and 
_ | hi 
G= BOIA Te A (12.17) 


The number of reflected electrons per unit length is proportional 
to B? (Sec. 11.7), while the corresponding number of incident electrons 
is proportional to A?. Therefore the fraction of the incident electrons 
reflected, or the reflection coefficient R, is given by 


— |B} _ (kı — ka)? (12.18) 
|A?|_ (Ri + ha)? 


Even if Po is negative, so that the electron is accelerated into 
region 2, there is a reflection coefficient which increases as the size of 
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the step increases. The situation is directly analogous to that in optics, 
where there is a reflected beam for normal incidence at any interface at 
which the optical properties change, regardless of whether the light is 
speeded up or slowed down as it enters the second medium. [Indeed, 
if ki = 27/M for the light in the first medium and k, = 2x/)s, Eq. 
(12.18) gives the reflection coefficient for light incident normally on the 
interface between two transparent media.] 

The transmission coefficient T is the fraction of the incident elec- 
trons transmitted across the barrier. To compare beam intensities in 
two regions where the particles have different velocities, we must note 
that the beam intensity is the number of particles crossing unit area per 
unit time, and this involves the product of the particle velocity and the 
number of particles per unit volume. In the nonrelativistic case the 
particle velocity is given by »/2K/m. Therefore in this case 


= fE=PolG| _ ke 4h?  _ _4kiks 
T= N ETA n hG+h) Gtk (12.19) 


Of course, conservation of particles requires that T + R = 1. 


12.3 Barrier Penetration 


The possibility in quantum physics that a particle may penetrate into a 
region where its total energy Æ is less than its potential energy P leads 
to the tunneling of particles through thin barriers which are classically 
impenetrable. Suppose that a beam of electrons is incident from the 
left on the rectangular barrier of high Po and width w of Fig. 12.3. 


0 iy © T 


Fig. 12.3 When a beam of free electrons with kinetic energy K = E is inci- 
dent on a thin ideal barrier of height P, > E, a beam is transmitted through 
the barrier. Small sections of the real part of the incident, reflected, and 
transmitted beams are shown in the rectangles. 
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Since P = 0 in regions 1 and 3 the Schrédinger wave equation for both 
regions is Eq. (12.7a), while for region 2 it is (12.7b). As we found in 
the preceding section, the solutions for a region of constant potential 
energy are sinusoidal when E > P (kinetic energy positive) and exponen- 
tials for E < P (K negative). The general solutions are: 


Region]: — 2» <z <0 


yı = A exp ae + Bexp (=F) (12.20a) 
Region 2:0 < z < w 


~v 2m(Po — E) 
te CORD am E) z 


+ D exp [20a] (12.20b) 
Region 3: w <z < œ% 


Va = Gep VINEE A Hesp (mea) (12.200) 


We immediately set H = 0 since no particles are coming from the 
right. However, B and C are not zero, since we have reflections at both 
surfaces. If we let kı = V/2mĒ/ħ and a = »/2m(Py — E)/ħ, applying 
the conditions that y and its derivatives are continuous at z = 0 and 
az = w leads to the four equations 


A+B=C+D 


(12.214) 
ik,A — ikıB = aC — aD (12.216) 
Cese + De-a = Get (12.216) 
aCe — aDe~ = ik,Geihe (12.214) 
_ Solution of Eq. (12.21) for A in terms of G yields 
i ki\ . 
A = Geib [cosh aw + 3 6 — r) sinh a| (12.22) 


The transmission coefficient is T = |G?|/|A?| or 


ONA l/a kiN ia 
Talo 7} +a(e+) sinh? aw 


a 


S 
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Since the hyperbolic sine increases rapidly with its argument, the 
transmission coefficient T decreases correspondingly with w. The sensi- 
tive dependence of the transmission coefficient on the height of the 
barrier is also of importance. 


12.4 The “Square” Well: Free States 


Let us replace the barrier of Fig. 12.3a by a well (Fig. 12.4). It proves 
convenient to place the zero of. potential energy at the bottom of the 
well; P = Pofor — œ < x < 0 (region 1) and for w < z < œ (region 3), 
and P = 0 for 0 < x < w (region 2). Again we begin with the Schrö- 
dinger wave equation 


h? dy 


-a Gi + Py = By (11.22) 


for which the general solutions in the three regions are: 


Region 1 
z i V2m(E — Pò x — i V2m(E — Po) x 
yı = A exp 7 ies e 
= Ae + Be-ihe 
Region 2 
hs C exp i rE x me (3 vane =) 
h À 
= Cet + De-*= 
Region 3 
mDAN — iv 2m(E — Po) 
v3 = G exp EEE Po) & + H exp [=E Pe xE zs 


= Get + He-the 


If a beam of particles is incident from the left and none from the 
right, H = 0. The requirement that y and its derivatives be con- 


Fig. 12.4 A square well of depth Py and width w. 
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tinuous gives for the boundaries z = 0 and z = w 


A+B=C+D (12.232) 
tk, A — ikiB = ik:C — ik:D (12.236) 
Ceit + Deh: = Geitve (12.23¢) 
ik:Ce*:» — ik De": = ik,Geite (12.23d) 


From these four equations one finds that 
A = Ge” | cos kaw — eE + ke sin kaw (12.24) 
> > 2 \kz ky z i 


The transmission coefficient T can be found from 


[a7 1 iz - E) sin? kaw 
l Pa? -2 V2mE w 
IEE Po sin 


(12.25) 


Equation (12.25) shows the interesting property that the transmission 
coefficient rises and falls as the energy Æ is increased, reaching unity 
when the argument of the sine is an integer times r. 

This result permits us to make a qualitative explanation of the 
Ramsauer effect, discovered in 1920 and inexplicable classically. Rams- 
auer found that when he passed a beam of monoergic electrons through 
certain noble gases such as argon, krypton, and xenon, there was almost 
no scattering for electron energies of 1 to 2 eV, while it was great for 
both lower and higher energies. Of course, an argon atom is only 
roughly approximated by a square potential well, but there is an attrac- 
tive potential as the incident electron enters the atom because the outer 
electrons no longer screen the nucleus. Argon atoms are almost trans- 
parent to 2-V electrons, because for that E the transmission coefficient 
given by the analog of Eq. (12.25) approaches unity. 

Some insight into the reason for the variation of T may be gained 
by recalling that both the wave function y and its derivative dy/dx must 
be continuous at both z = 0 and z = w. As E is varied, the wave- 
lengths are changed, both inside and outside the well. Only if the width 
of the well has precisely the correct relationship to the wavelength can 
the wave in region 3 have an amplitude equal to that of the incident 


wave in region 1. 
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12.5 Wave Packets and the Momentum Representation 


Thus far we have been considering uniform beams of particles such as 
electrons; we now turn our attention to a single electron, which may be 
regarded as a more or less localized entity. Let us assume that at time 
t = 0 a single noninteracting electron is located near © = To with the 
probability @ dz of finding it between x and z + dr being given by a 
known function such as that of Fig. 12.5. The curve shown is associated 
with the particular wave function 


V(2,0) = Aeta pipon (12.26) 


where A = a~r if W is normalized so that [72 W*v dr =1. For this 
case a is the half-width of the curve at the points where (z,0) has 1/e of 
its maximum value. In the discussion which follows, we shall develop 
the theory in terms of a general Y and then use Eq. (12.26) as a particular 
example for carrying out calculations. 

In general, if a particle is associated with a normalized wave func- 
tion ¥(z,l), we can specify the average value of z for the particle by giving 
each possible value of z a weight assocjated with the probability that 
the particle is between z and z + dz and integrating over all possible 
values of z. The value so obtained is called the expectation value for 
x, which we shall represent by (x). At any instant / 


(z) = fz W*(a,l)r¥(a,l) dr (12.27) 


P= Y* Y = 42p-(2-2)2/02 


IW (x,0)] 


Fig. 12.5 The function @(z) for a gaussian wave packet 
where @(z) represents the probability of finding the particle 
between z and z + dz at time! = 0. The finer curve shows 
the real part of ¥(z,0) for the wave function of Eq. (12.26), 
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For the particular W of Eq. (12.26), at t= 0: 


w= f? L recie da = zo : (12.27a) 


oar 


Any function, such as (12.26), appropriate as a wave function for 
a single noninteracting (i.e. potential energy P = 0) particle can be 
represented as an infinite sum of sinusoidal wave functions through a 
Fourier integral. Clearly for any value of p the simple wave 


W,(a,l) = Crh) plp) PEN (12.28) 
satisfies the Schrédinger equation (11.20) in any region where P = 0. 
Here $(p) is any function of p not containing z or l, and Æ is a function 
of p, since E = p?/2m. Since the sum of any number of such solutions 


is also a solution, we can multiply (12.28) by dp and integrate to obtain 
a general solution 


V,(,l) = if (2h) (p)et x91" dp (12.29) 
where (27h)-} is a normalizing constant. Then 


¥,(a,0) = f 7 Crh) o(p)e=l dp (12.29) 


and by the Fourier integral theorem 


o(p) = nh) [7 ¥(c,0)e-*" de (12.30) 


Note that although W is a function of z and t, (p) is not a function of 
time since the various momenta in the wave function for a free particle 
do not change in time. 


Now we return to the particular ¥ of Eq. (12.26) and write 
1 


(rh) f 3 gia eine lt de 
Sia 


[i 


o(p) 


= J fh erreen (12.30) 
T 


+> 
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A knowledge of either ¥(z,0) or of ¢(p) permits us to determine the other, 
so that either Y or ¢(p) specifies the state of the particle. Since ¥(z,0) 
can be any normalized function, every possible wave function for a 
noninteracting particle can be written in the form of Eq. (12.29) by 

plication of Eq. (12.30). The factor (27h)~} in Eq. (12.30) was intro- 
duced so that the momentum wave function ¢(p) is normalized: 


[2.6 @)e@) dp = 1 (12.31) 


The interpretation of ¢*(p)¢(p) dp is analogous to our interpreta- 
tion of y*y dx; viz., 6*(p)¢(p) dp is the probability thal the momentum 
of the parlicle lies belween p and p + dp. 

From Eqs. (12.302) and (12.31) we see that the probability dis- 
tribution in momentum is of the same form as that in space for t = 0, 
except that the half-width at 1/e of the maximum is /a. The expecta- 
tion value for p can be found in either of two ways: (1) in the momentum 
representation 


tv) = [7_ oopa) dp = i | 


pe- -rt dp = po 


(12.32) 
and (2) in the coordinate representation 
(p) = f7, v*@DpYG,) dz 


To express p as a function of z, use is made of Eq. (11.24), from which 


a, hd 
PY = 7 de 
; This yields 
| 
5 ħ ðY (x,t) 
(p) = ihe Te) r de = Po (12.32a) 


We now turn to the question of how the motion of the electron, 
described at t = 0 by Eq. (12.26), proceeds in time. By al 
Eq. (12.30a) in (12.29) and making use of the fact that p?/2m = E 
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we have 


a l f= pty ped*at/2hg—ilp—radzolhgilrz—p't]2m) [hq 
Vz) = Vz EIAS á 


A m a eirerelh fs e(P—P0)*2*/ 284 gip(=—z9) /h g—ip%t/2mh dp 
=, 4 aa = ee RIA 
(z — To — pol/m)2(1 — iht/ma? 
x ex | - Sa ea /ma® | (12.33) 


from which the probability distribution is 


O(at) = UY 
1 h2l?\-3 (x — zo — Pol/m)? 
-7 G x =) exp [ 5 Sentra | (12.34) 


The probability distribution remains gaussian as it moves with a velocity 
Po/M, but it broadens as time goes on, corresponding to an increasing 
uncertainty of position. 

In 1927 Ehrenfest proved that in quantum mechanics a particle 
moving in a potential field P(x) obeys the classical laws of molion 


Hei SARO and (=m Ha 


ox 


in the limit thal the uncerlainlies in momentum and position may be ignored. 
Thus, in the macroscopic limit wave mechanics yields the same predic- 
tions as newtonian physics. Proof of Ehrenfest’s theorem is left to the 
problems. 

A wave function such as that of Eq. (12.33), used to describe the 
motion of a particle for which the position and momentum cannot be 
known exactly, is called a ware packel. It is formed by superimposing 
an infinite number of sinusoidal waves to yield a resultant which is zero 


everywhere except for a restricted region in which the particle is to he 
found. 


12.6 The Heisenberg Uncertainty Principle 


When the Heisenberg principle of indeterminacy was introduced in Sec. 
11.8, no clear definition was given of Az and Ap., the uncertainties in 
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coordinate and conjugate momentum. Heisenberg originally discovered 
the principle from a mathematical formulation, and we now treat it 
more quantitatively. It is possible to define Ar and Ap, in various 
ways; perhaps the most useful definition arises from the following con- 
siderations. Imagine a series of measurements of x to be made on many 
identical systems [such as one described by the wave packet of Eq. 
(12.26)] with the result that a number of values of x grouped around 
zo are obtained. We define Ar by the relation Ar = [Z(z — x0)?}}, 
which is known as the slandard deviation. More generally, in terms of 
expectation values, 


(Az)? = (@ — ())*) = (1°) — 2x) + (1)? 
= (x) — 2(x)? + (2)? = (r°) — (x)? (12.35a) 
(Ap)? = ((p — (p))?) = (p°) — (Pp)? (12.356) 


Thus Ax and Ap are the root-mean-square deviations from the mean. 
For ¥(a,0) of Eq. (12.26), we have after using Eq. (12.27a), 


(Aa)? = ((@ — 20)?) 


s (x — zo) e= dx = > (12.36a) 


Fa 
where we have made use of integral I, of Appendix 4A. Similarly 
(Ap)? = ((p — Po)?) 5 
= T [Z © — pore or" dp = z(a). (12.366) 


Therefore Ap Az = h/2. 

It can be shown that no wave packet can be constructed which 
has a smaller value of Ap Az [as defined in Eqs. (12.35)] than the wave 
packet of Eq. (12.26). If we putz — Tọ = Ar = a/V2 into Eq. (12.26), 
we find that @(xo + Az, 0) = 0.6065(r0,0) and that the probability of 
the particle’s being found between To — Az and xo + Az at l = 0 is 0.68. 


12.7 Probability Stream Density 


A stream of protons or electrons constitutes a flow of charged particles, 
and hence an electric current. How can such a flow be quantitatively 
described in terms of wave functions? 
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To find the quantum-mechanical formula for expressing particle 
flow we begin with classical relations. Let p(z,y,z,l) be the charge 
density and consider the charge in a very small yolume element Az Ay 
Az (Fig. 12.6), first for the case that all velocities are along the z axis. 
The net charge per unit time entering Az Ay Az at the left is pv. evalu- 
ated at z = a times Ay Az, while that leaving at the right is 


(pvz)araz Ay Az = [pvz + A(pvz)/dx Ax] Ay Az 


The charge in Az Ay Az is, of course, p Ax Ay Az. If more charge 
leaves per second than arrives, p decreases in time, and conservation 
of charge requires that 


(pvz)a+az Ay Az — (pvz)a Ay Az = Mg Ar Ay Az 


r 
ence Az Ay Az (12.37) 
at 
from which 
a (pvz) DFA Op 
EEE ae aL (12.38a) 


in one dimension. If we extend the reasoning to three dimensions, 
we have 


op 


VIMA ERETT (12.38b) 


which is the equalion of continuity, applicable to any flow process in 
which p represents the density of something that is conserved. 


a+ ôx 
Fig. 12.6 Charge entering and leaving a small cubical 
volume element Az Ay Az for a one-dimensional flow. 
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To find the quantum-mechanical analogs of Eqs. (12.37) and | 
(12.38) we identify p with g¥*¥ where q is the charge borne by a 
particle and W*W dz dy dz is the probability at time ¢ of finding a 
particle in the volume element dz dy dz. (Note that fffw*v dz dy dz 
over all space is normalized to give the total number of particles ` 
enclosed.) We assume that there is a quantity S which represents — 
the probability stream density of the particles in such a way that gS 
plays the role of py in the equation of continuity. We seek S in 
terms of Y by writing the equation of continuity as 


V. qS = —a(q¥*y)/at 
or 


neg s p (wry) (12.39) 


With the aid of Eq. (11.23a), we obtain 


y-S= yw 
De A (E vyt — WF VY) (12.40) 
a 
s= @* vv- ve") s a241). 


2m 


The quantity S is a vector probability current density for the — 
flow of particles; if each particle bears charge q, the corresponding 
probability current density is qS. If A denotes any surface area 
enclosing a volume U, the integral of the normal component of S 


L over the surface is equal to the time rate of decrease of probability 


of finding a particle within the volume; thus the left-hand integral 
represents the rate at which probability is streaming outward across 
the closed surface A, while the right-hand integral represents the 
rate at which the probability is decreasing within the volume. 

When Y has the form of Eq. (11.13), then ¥* = A*eiEÃ pI 
and 


s=Biar 
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where S is the magnitude of S. Thus the significance of the beam 
represented by Eq. (11.13) can be based upon the vector S by assum- 
ing the number of particles crossing unit area per secondisn = p|Al?/m, 
where p/m is, of course, the speed of the particles. 


Problems 


1. Show that the sum of the reflection and transmission coefficients 
is equal to unity for the step barrier of Sec. 12.2. 


2. Show that the reflection coefficient R for the step barrier of Sec. 
12.2 is given by 


RE VI — P/E — i 
= (VI -P/E +1 


and thus that »/I — P,/E plays the role of relative index of refraction 
in optical reflection at normal incidence. Prove that so long as Æ is a 
constant greater than Po, the greater |Po| is for the step barrier, the 
greater the reflection coefficient, whether Po is positive or negative. 

3. Solve Eqs. (12.21) for A in terms of G and then derive Eq. (12.23). 
4. Find the reflection coefficient |B/A|? for the barrier-penetration 
situation of Sec. 12.3 (Fig. 12.3) and show that the sum of the reflection 
and transmission coefficients is equal to 1. 

5. Find the reflection and transmission coefficients for the barrier of 
Fig: 12.3 for the case where E > Po. . Show that the transmission 
coefficient goes to unity for certain values of the energy. Explain how 
this can happen when there is reflection from both steps of the barrier, 
6. Find the reflection coefficient R for the square well of Fig. 12.4. 
Show that R+ T=1. Find the values of E for which reflectio ı is 
maximum if Po and w are fixed and calculate the largest value of R. 


7. A stream of electrons with energy E > P; (Fig. 12.7) is incident 
from x = —© on the potential barrier shown. 


Fig. 12.7 
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(a) Write the general solutions of the Schrödinger wave equation 
in regions 1, 2, and 3. 

(6) If the amplitude of the incident wave in region 1 is A, find 
sufficient equations to evaluate all the other constants introduced in 
part (a) in terms of A and other given quantities. (It is not necessary 
to solve these equations.) 

8. Repeat Prob. 7 for the case where Py < E < Pz. 


9. Show that for any dynamical function G(z,p.), 
W _ =f yt z (AG - GA dz 
Hint: Write (G), differentiate, and use 


tow 
Aw = By = ERON 


10. Ehrenfest’s theorem states that for a particle for which the uncer- 
tainty in position and momentum can be neglected, the quantum- 
mechanical prediction of the motion agrees with the classical prediction. 
Show that this theorem is valid for a one-dimensional motion for which 
P = P(x) by showing (a) that 


da) _ fyi 


and (b) that 


dp.) _ /_ dP 
“dl dz 


chapter thirteen 


Wave Mechanics II 
Bound States 


When a particle is restricted to moving in a limiled region, quantum 
mechanics yields the prediction thal only cerlain discrele values of the energy 
are possible. In this chapter the allowed quantum slates for the harmonic 
oscillator and for a particle in a box are studied, while in Chap. 14 the 
problem of an electron bound to a nucleus is trealed. 


13.1 Stationary or Quantum States 


Consider an electron or other particle bound to some region by a force 
field such that the particle cannot escape classically; i.e. 
E is too small for the particle to reach infinity. In the classically 
allowed region E is greater than the potential energy P, assumed to be a 
function of position only; where P > E, the particle has negative kinetic 


, its total energy 
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energy K. To treat such cases, Schrödinger suggested seeking solutions 
of the wave equation which represent standing waves. In such waves 
the phase of the vibration is everywhere the same, whereas in running 
waves there exists, at any moment, a progression of phase along the 
wave train. In a mathematical expression for representing standing 
waves, therefore, the time must occur in a separate factor. 

As we saw in Sec. 11.9, when P is not a function of the time, we 
may write 


W(z,t) = W2)SY = oa)e-izun (13.1) 


where y satisfies the Schrödinger wave equation 
h? dy 
- Fe Ta + P(e) = Ep2) (11.22) 


Not every mathematically correct solution of the Schrédinger wave 
equation is physically acceptable. Intimately tied to various physical 
conditions required to give the solutions reasonable meaning are certain 
mathematical conditions which we shall introduce as the need arises. 


13.2 General Solutions for the Square Well 


Let a single particle of mass m be bound in the square potential well 
of Fig. 13.1 for which P = 0 for 0 < z < L and P = Py for z < 0 and 
z >L. The energy E of the particle is less than Po if classically the 


Po 


- Eo 


Energy 


Fig. 13.1 A square well of depth Po. Also 
shown are the two lowest energy levels and the 
associated wave functions y, and y». 
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particle is confined to the well. This is mathematically a particularly 
simple case of a bound particle for which a complete solution can readily 
be found. The solution proves useful for treating such topics as gases 
and electrons in metals, as we shall see. Qualitative features of bound- 
particle problems in general are well illustrated by this example. 

For regions 1 and 3 the Schrödinger wave equation is 


h? dèy 


~ mdr? 


dy 


+Pw= Ey or SY = E (Pa -EW (13.2) 


Since Po > Æ, the general solution is 


Region 1 


vm — E) 
t= A exp VIP Bs 


Vm -E 
Be [ ds Ven(Po= Be = Ae + Be (13.3) 


Region 3 


vm, — EV 
a WG deg ta E) z 


: < Sik 
+ H exp [- AA IE) 2 i ohet A 


where A, B, G, and H are constants to be evaluated and we have put 
V2m(P, — E)/h = a. Note that in regions where P is a constant 


greater than the total energy, the solutions of the Schrédinger wave 
equation are exponential functions. 


For region 2 the wave equation is 


h? dy dy 2mE 
— om az? + 9 = Ey or IE a ae y (13.5) 


This is the simple-harmonic- 


motion differential equation, for which the 
general solution is 


v= epee D’ exp (- ones) 
= Csin VREE 1p eog VME z 


= Getz + D'et = C sin ke + Deoskz (13.6) 
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where C’, D’, C and D are constants and k = yy 2mE/h. Observe that 
for regions of constant P less than E, the general solutions of the wave 
equation are oscillating functions which may be expressed equally well 
in terms of imaginary exponentials or by trigonometric functions. 

We have postulated (Sec. 11.7) that the probability of finding our 
particle between z and z + dz at any time / is given by ¥*(z,l)¥(z,l) dz. 
If we sum this probability over all possible values of z, we have for 


this problem the normalization condition is Y*Y dz = 1, since there 


is only one particle and it is somewhere. This fact enables us to show 
that B and G of Egs. (13.3) and (13.4) must be zero. Since Be-= in 
region l is an exponentially increasing function as z goes from 0 to — o, 
the normalization condition can hold only if B = 0; a similar argument 
shows that G = 0. Thus in both regions 1 and 3 the only allowed wave 
functions are exponentially decreasing as we leave the potential well a 
yı = Aes and p; = Hee, 

If ¥*¥ dz = 4*4 dz is to represent the probability of finding the 
particle between z and z + dz, the reasonable expectation that this 
probability be finite, single-valued, and continuous can be met by impos- 
ing on y the following condition: 

1. is finile, single-valued, and continuous. 

Further, we shall find that if the momentum of the particle is to 
be finite and continuous, the first derivatives of y must also be finite 
and continuous. It turns out that the momentum does change dis- 
continuously at an infinite potential barrier—a not uncommon situation 
in which an unrealizable physical condition forces an unnatural mathe- 
matical requirement. We therefore add the following condition: 

2. dp/dx is conlinuous (except when P goes to infinity). 

When we apply conditions 1 and 2 at the boundaries z = 0 and 
x = L for the particle in the square well, we obtain the equations 


A=D (13.7@) 
aA = kC (13.7b) 
C sin kL + D cos kL = He-# (13.7) 
kC cos kL — kD sin kL = —aHe-#" (13.7d) 


These equations, together with the normalization condition, permit the 
determination of all constants. It is simplest to consider first the arti- 


ficial but highly important case of an infinite well, for which Py is equal 
to infinity. 
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13.3 The Infinite Square Well 


When Pe goes to infinity, a becomes infinite. If Egs. (13.7) are to be 
satisfied, kC must be finite, and so A must be zero. Since A=D, 
D=0. Since He-@" is zero, C sin kL = 0. [Equation (13.7d) is not 
icable. 

apri Cc ote zero, y would be zero everywhere, which is incompatible 
with the normalization condition, Eq. (11.15a). Therefore, we con- 
clude that sin kL = 0, or kL = nr, where n is an integer. Since 
k = »/2mE/h, acceplable solutions of the Schrédinger wave equation exist 
only for cerlain discrete values of the energy satisfying the relation 


= tht (13.8) 
"~ 8mL? 


An important prediction of quantum mechanics, quite at variance 
with classical physics, is that the particle may have for its energy only 
certain specific values. These are known as eigenvalues (eigen means 
proper or characteristic); only the eigenvalues permit acceptable solu- 
tions of the Schrédinger equation. For the infinite square well there 
is an eigenvalue given by Eq. (13.8) for every integer n, known as the 
quantum number. 

Of special interest is the fact that the energy of the particle cannot 
be zero, since n = 0 gives y = 0 everywhere—hence no particle. The 
lowest possible energy, or zero-point energy, in this case corresponds to 
n=1. Thisis directly connected tothe Heisenberg uncertainty principle, 
since if the position of the particle is certain within Ar = L/2, there 
must be an uncertainty\in the momentum not less than that given by 
Eq. (11.17). In the first eigenstate, Ey = h?/8mL? = Pi?/2m, from 
which p,, the momentum \in the first eigenstate, is +h/2L. Since the 
momentum can be in either direction in the well, the uncertainty Ap, is 


essentially h/2L. Therefore, Ag Ap, = h/4, a result entirely compatible 
with the uncertainty principle, \ 


Corresponding to each eigenvalue (and quantum number n) there 
is an eigenfunction given for 0 < < L by 


Ya = C, sin kat = C, sin Te ; (13.9) 


and for — œ <2<0andz 2Lbyy, = 0. 
From the normalization condition 


$ lo {11.15a) we can obtain C, since 
fe v*~ dr = Í C? sin? (nrz/L) dz = 


@L/2 = 1, whence C = V2/L. 
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Therefore the normalized eigenfunctions are 


vn = [isin T ) for0<z<L (13.92) 


The eigenfunctions correspond to those of a vibrating string of length L 
and fixed ends. A few of them are shown in Fig. 13.2, along with the 
corresponding eigenvalues. A glance at the first eigenfunction shows 
that the probability |y|? dz of finding the particle in a small element 
of length dz is much greater at the center of the well than at the edges. 
This again is in sharp contrast to the classical case, for which the particle 
would move between barriers with constant speed and would be equally 
likely to be found near the walls or near the center. It is a surprising 
prediction of quantum mechanics that the particle should “avoid” the 
walls. 

As one goes to higher quantum number n, ¥ oscillates more rapidly 
and thus the probability of the particle’s being found in any length dz 
becomes more uniform across the well in accord with the correspondence 


Fig. 13.2 The first three energy levels and 
the corresponding wave functions for an infinite 
square well. 
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principle. Even so, y remains zero at the walls. It should be noted 
that the number of zeros of y other than those at the boundary is given 
byn—1. 


13.4 The Finite Square Well 


We have argued in Sec. 13.2 that in regions 1 and 3 the wave function 
is given by yı = Ae% and y, = He-=**, declining exponentials as we go 
away from the well. Inside the well y, is represented by oscillating 
trigonometric functions. If y and its derivative are continuous at the 
boundaries, the trigonometric function must be decreasing in magnitude 
as the boundary is approached (Fig. 13.1). When Po is large compared 
to E, Eqs. (13.7) require that A, D, and H be small but not zero. The 
resulting eigenfunctions bear close resemblance to those for an infinite 
well, except that there is a small exponential tail outside the well, showing 
some penetration of the particle into the classically forbidden regions of 
negative kinetic energy. 

The wavelength of the eigenfunctions for the finite well are some- 
what greater than those for the infinite well. Longer wavelength corre- 
sponds to lower momentum and to lower energy. Thus Eq. (13.8) 
predicts somewhat too great an energy for each eigenvalue of the finite 
well, 

As the difference P, — E decreases, penetration into the classically 
forbidden regions increases; that is, A and H of Eqs. (13.7) increase, 
and æ decreases. Further, the number of bound states decreases, since 
as n increases, the energy of the bound state rises, and for a 
Square well with Po — E finite, the number of bound states is finite. 
Once E exceeds Po, the particle is free, and any energy is an eigenvalue, 


13.5 Expectation Values 


In problems such as ones involving an electron in a square well (or bound 
to a proton to form a hydrogen atom), 
definite values for some quantities, 
but not for others, e.g. 


(@) = ["_ ¥*(@,)2¥(@) dz 
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where & is a normalized wave function. In a similar way the expecta- 
tion value of any function f(z) is given by 


Uf(a)) = i Y+ (xD f (£)¥ (2,4) dz (13.10) 


Thus, if the potential energy is a function P(z), the expectation value 
for P is 


(P) = if ¥* (a) P(z) ¥(z,l) dz (13.102) 


As we saw in Sec. 12.5, to obtain the expectation value of the 
momentum we must express pz as a function of z, but the uncertaint 
principle specifically denies that we can know p, and z exactly and ial. 


taneously. To determine (p=), we replace f(z) by ti i 
ator p- [Eq. (11.24a)] to obtain 1h a e oeaaniael 


(pa) = [7 ¥*@0B.@)¥(2,)) dz 


= [7 wD £ et) dr (13.11) 


13.6 Differential Operators 


In three dimensions the moi 
! operator for the i 
entai ai mentum p is the vector ` 


h 
p=--V 
Pe iE (13.124) 
and 


p = hy? 
(13.126) 


/In the nonrelativistic case the kinetic ener, i 
á gy K ofa ti i 
p?/2m, and the operator for K is E 


à 
k=- bv (13.13) 


For a one-dimensional problem 


(k) = [7 een (- a z) U(2,t) dz (13.14) 
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which can be readily integrated by parts to obtain 
h? av |- = ov aw* 
(ky = - Ë (ese oo eee 
The normalization condition fE; ¥*v dr = 1 requires that ¥* be zero 
atr = +œ, and therefore 


ew * 
(K) = = i H oe dz (13.15) 


Thus the expectation value 
integral of |av/az|?. i ' : 
The operator for angular momentum is of great interest in atomic 


theory. The classical expression for the angular momentum A, of a 
particle about the z axis is 


of the kinetic energy is proportional to the 


A, = (r xX p): = zp, — YP: 


Replacement of p, and Pz 


gives as the corresponding operator in wave 
mechanics 


if a a 
alg Pay as (13.16) 
If polar coordinates are introduced with the z axis as axis, then 


T = r sin 6 cos ġ Y = rsin 8 sin Zz = r cos 


oak = 9x 2 + ay ð 22 ð 
ð$ 3$ ax ag ay ~ Fay — ¥ >= 
hus the to 
sue Sperator for the z component of angular momentum can also be 
Gee ha 
"Tag (13.162) 
Analogous expressions are found for the 
z 
tor for the square of the resul and y components The opera- 
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In wave mechanics physical magnitudes in general are represented 
by appropriate operators. The ultimate test of the validity of the opera- 
tor assumed to represent a given magnitude will lie in a demonstration 
that the values of this magnitude as predicted by wave mechanics are 
in agreement with observation. The accepted condition that any phys- 
ical magnitude Q has a definite value or is quantized is that, if Ô denotes 
the corresponding wave-mechanical operator and Y the wave function 
for the system to which Q belongs, then 


Ov = Q.¥ (13.18) 


where Q, is a real number, the value of Q. Solutions of this equation 
satisfying appropriate boundary conditions Tepresent stationary or quan- 
tum states for the magnitude Q; and the sct of allowed values of Q, 
includes the only correct values that can be obtained in an experimental 
measurement of Q. 

The requirement that the result of a measurement be a real number 
admits only real solutions for expectation values and eigenvalues. This 
limits the type of function which can serve as operators for physical 
quantities to those which satisfy the equation (Q) = (Q)*; that in turn 
imposes the condition that J¥*OW dz dy dz = J¥(QW)* dx dy dz. An 
operator which satisfies the latter equation is said to be hermilian; 
operators for physical observables in quantum mechanics must be 
hermitian to guarantee that the expectation value of the observable 
will be real. 

Two operators Ĝ. and F are said to commute when QP — PO = 0. 
Hermitian operators do not in general commute; for example, 


h 
Tm or i’ — mn; ¥ 


The operator 6 = Of — FQ is the commutator operalor of Ô and f. 


The equation above tells us that Ĉ(z,p:) = —ħ/i. On the other hand 
C(y,p2) = 0, 30, that y and p- do indeed commute. The commutator 
operator C(Q,F) is often written [Q,F] 


13.7 The Rectangular Box 


As a first example of a three-dimensional problem in wave mechanics, 
consider a rectangular box (Fig. 13.3) with sides of lengths L,, We Le. 
Let the potential energy P inside the box be zero and that outside be 
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Fig. 13.3 A rectangular box with sides Lz, Ly, La. 


infinite. Then outside the box ¥ = 0, while inside 


a. 
= am VY +0 = Ey (13.19a) 
or 
ay, Oy, a ame (13.198) 


ðr? © dy? © dz? 2 


If we let /imE/h = k, assume that y(a,y.2z) is separable so that 
W(a,y,2) = X(z)¥(y)Z(z), and divide Eq. (13.196) by Y, we obtain 


lx lay  1aZ 
X dat +yat zm Zk (13.20) 


Here we have the sum of a function of z alone, a function of y alone, 
and.a function of z alone equal to a constant. This can be true only if 


each term on the left is separately constant, and so we write 


` (13.210) 
LPY k i (13.218) 


CA (13.21¢) 


ive sign i for a reason explained 
where the negative sign 18 chosen for the constants p 
below. Of course, ka? + hy’ + k? = ki to satisfy Eq. (13.20). 
A general solution for (13.21a) is 
X = Azsin kz + B, cos kst (13.22) 


where A- and B- are constants. At x = 0 continuity of y requires 
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that X(0) be 0, and so B, must be zero. Also X(L.) = 0, so that 
A, sin kL, = 0 (13.23) 


from which it follows that k.L2 = na, where nz is an integer. Thus 
we have 


narr 


X = A,sin L ; (13.24a) 


We can now bee why the separation constant was written —k.?. If the 
constant had been chosen positive, the solution for (13.21a) would have 
been exponential functions, and it would have been impossible to satisfy 
the conditions that X(0) = 0 and X (L:) = 0. Similar developments 
for Y and Z give 


Y = Aysin ary (13.24b) 
Y 
Z = A,sin an : (13.240) 


and 


y = Asin e z sin TY sin = z (13.25) 


where A = 1/8/1.L,L, if the normalization is for a single particle. 
The eigenvalues for the energy may be found from 


ny? n 
(EE SL ie h? 


ke he + byt kè = re (Mey n rs) = nE 


or 


2 2 2 2 
E= £ (s + ie + ra) (13.26) — 


The lowest possible energy, or ground-state energy, 
nz = 1, ny = 1, m = 1, and it depends on the dimensio 
For an electron in a box with sides of the order of 1 


energy is of the order of electron volts. 


corresponds io - 
ns of the box. 
A, the smallest 


a. a 
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For a cubical box L- = Ly = L. = L, and the energy eigenvalues 
are 


2 
E = gp (nt + mè + ne) (13.26a) 


The ground-state energy Hi is, 3h?/8mL?. The next lowest energy 
corresponds to nz = 2, n, = li n, =1 or m: = l, ny = 2, n: = 1, or 
n: = 1,n, =1,n,= 2. Thus there are three eigenfunctions, Eq. (13.25), 
which give rise to the same energy, and we say the first excited state 
is triply degenerate. In general, two or more eigenfunctions are degenerate 
when they lead to the same eigenvalue for the energy. A ninefold 
degeneracy occurs for the energy 38h2/8mL?, which can be associated 
with nnn: given by (611), (161), (116), (532), (523), (352), (325), (253), 
(235). The degeneracy can be removed for these low-lying states by 
destroying the cubical symmetry of the box; if L., Ly, and L, are slightly 
different, the energies for these states no longer coincide. 


13.8 The Harmonic Oscillator 


A problem of great importance is that of the simple harmonic oscillator, 
not only because it is one of the relatively few problems which can be 
solved exactly, but also because the methods used have broad application. 
There are many oscillating systems in nature which can be treated (at 
least approximately) in terms of simple harmonic oscillations. 

-. -Consider a particle of mass m bound to the origin by a force 
F = $r, where £ is a constant. For the one-dimensional problem, we 
choose the potential energy P = 0 at the origin and then the potential- 
energy fukction P(x) = 48x? (Fig. 13.4). The Schrödinger wave equation 


Ay = (pm + P)y = Ey for this problem becomes 
à d: ; 
— AATE + gee = Ey 3:27) 


The solution of Eq. (13.27) is carried out in Appendix 13A, where it 
is shown that Eq. (13.27) can be satisfied only for discrete energies 
\ 


E, = (n+ Hh» (13.28) 
3 x 
where n = 0, 1, 2, . . . and » is the classical frequency of the oscillator 


given by v/8/m/2r. As is usual in quantum_mechanics, bound states for 
the particle exist only for discrete values of the energy. The ground (or 
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Fig. 13.4 The potential energy of a simple harmonic oscil- 
lator as a function of displacement (heavy curve) and the 
first four energy levels together with the associated wave 
functions. 


lowest) energy state of the harmonic oscillator has n = 0 and Ey = 3hv. 
This energy, the zero-point energy, has no direct classical analog. The 
requirement of finite zero-point energy for a harmonic oscillator is of 
importance in thermodynamics and is an inevitable consequence of the 
Heisenberg uncertainty principle. 

Corresponding to each eigenvalue E, is the normalized eigenfunction 


ee Dahr a t 
Yn(z) = (r eH, (ax) (13.29) 


where a = ~/mB/h? and H,(az) represents the nth Hermite polynomial 
(Appendix 13A). The first five Hermite polynomials are 


Ho(az) = 1 H, (az) = þat H:(az) = 4a?r? — 2 
Halaz) = 8a%x? — l2at Hi(az) = 16atz* — 48a°r? (13.30) 


+12 
and the first two eigenfunctions become 
i 
vole) = (=) quel (13.292) 
Walz) = ee } bageaz] (13.29b) 
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The first four eigenfunctions are sketched in Fig. 13.4. Two fea- 
tures deserve special mention. First is the penetration of the particle 
into the classically forbidden region. Second is the fact that, particularly 
for the lower states, the probability of finding the particle in any element 
dz of the allowed range is radically different from the classical predictions. 
Classically it would be most likely to find the particle near the end of its 
motion, where its speed is minimum, and least likely at the equilibrium 
position. Quantum mechanics however predicts that for the ground 
state the particle is most likely to be found near the equilibrium position. 
As one goes to higher values of n the probability distribution gradually 
becomes closer to the classical one, as is shown in Fig. 13.5. 

From Fig. 13.4 as well as from Eqs. (13.29) and (13.30) it is clear 
that y is an even function [y(z) = ¥(—z)] when n is even and an odd 
function [y(xz) = —¥(—z)] when n is odd. Such symmetry is typical of 
standing wave eigenfunctions arising from a symmetrical potential energy 
function; the y functions have alternate even and odd symmetry relative 
to a reflection of the coordinate at the “force” center. The functions are 
said to exhibit even or odd parily according as they are even or odd. If 
in Secs. 13.3 and 13.4 the origin had been located at the center of the 


Classical 
prediction 


Fig. 13.5 The probability density function fy.) for the 
n = 10 state of a harmonic oscillator. The vertical lines show 
the classical amplitude for the energy (10 + 4)hy, and the dashed 
curve shows the classical probability distribution function for 
finding the particle between z and z + dz. (Modified from 
L. Pauling and E. B. Wilson, Jr., “Introduction to Quantum 
Mechanics. Copyright 1935. McGraw-Hill Book Company. 
Used by permission.) 
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potential well, the resulting eigenfunctions would also have been alter- 
natively of even and odd parity. 


13.9 Properties of Eigenfunctions 


Let ¥,(z) and ¥;(z) be any two normalized nondegenerate eigenfunctions 
of the Schrédinger equation for a bound particle with corresponding 
eigenvalues E, and E;. The eigenfunctions have the important property 
that 


Winde =0 ifjæn (13.31) 


and are therefore said to be orthogonal. The orthogonality relation and 
the normalization condition are often combined into the relation 


JES Vpn dz = 6; : 


where 6;;, known as the Kronecker della, is 0 for j = n and 1 for j=n. 
Proof of the orthogonality relation appears in Appendix 13B. 

Each of the wave functions Ya(z,t) = Wn(x)e-*®-"/* is a particular 
solution of the time-dependent Schrödinger equation, and any linear 
combination of them is also a solution. Thus 


P(r) = aopoeFo* + aype=iBih y 


= Yi anppeiEnti (13.32) 


is a general solution; here the a’s are constants which may be complex. 
An important property of the allowed solutions ¥(z,y,z,ł{) of the 
Schrédinger equation [and indeed of the more general Eq. (13.18)] is that 
they form a complete set of functions in terms of which any given wave 
function can be expanded in the analog of a Fourier series; thus 


¥= Yawn (13.32a) 
n 


In some cases a continuum of possible values of E occurs, and then the 
sum must be supplemented or even replaced entirely by an integral anal- 
ogous to a Fourier integral. If all functions are properly normalized, 
|a,|? equals the probability of obtaining the value E, as the result of a 
measurement of E performed upon a system initially in the condition 
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represented by ¥. It may also be remarked that if the entire setup and 
measurement are repeated many times, the average of the observed 
values of Æ will be very close to the expectation of E, defined as 


(E) = J¥* BY dq = Juin ÊF dg 


where dq includes all the coordinates involved in ¥. This is easily verified 
by substituting the series for ¥ and recalling that [¥}W, dq = djn. 

In one dimension the probability of finding a particle in a given ele- 
ment of léngth dz is ¥*¥ dz. When the particle is in a definite quantum 
state j,¥*¥; = yf; = const, so that the probability distribution does 
not change with time. However, when more than one a, of Eq. (13.32) 
is nonzero, the particle is in a mixed state represented by the wave packet 
of (13.32). In this case the probability density |¥|? varies with time, and 
the particle can be regarded as moving around. The a’s have the signifi- 
cance that |a,|? is the probability of finding the particle in state n, pro- 
vided some experimental means exists of selecting particles in that 
quantum state. [In certain problems, however, continuous ranges of 
allowed values of Æ also occur; in such cases, in Eq. (13.32), the sum over 
the states with discrete E’s is supplemented by an integral analogous to a 
Fourier integral.] That |¥|? does indeed vary in time can be seen by 
carrying out the indicated evaluation 


V*(ZDY@l) = X atay? (e)ya(z) 
i + Y Datar ye EE (13.33) ` 


ján n 


which is a function of the time. 


13.10 Transitions between States 


In classical electromagnetic theory an accelerated charge radiates energy 
at a rate which is proportional to the square of its acceleration. In quan- 
tum mechanics the uncertainty principle tells us that the position and 
velocity of the charge cannot both be known with unlimited precision. 
In going from classical to quantum theory one obtains valid predictions 
if one replaces the classically well-defined position coordinate z by its 
expectation value (z), and we shall use this semiclassical approach because 
of its relative simplicity. 
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Consider an electron in a stationary eigenstate such as we have dis- 
cussed for a harmonic oscillator or for a square’well. In either case, for 
level n, Y,(x,l) = ,(x)e-*£-'/*, and the expectation value for z is 


(2) = [T wtrw de = JOa viet. de (13.34) 


The expectation value of z is not a function of time. Therefore d*(x)/dl? 
is zero, and no power is radiated. An eleclron ina slalionary slale does nol 
radiale. In Bohr’s half-classical, half-quantum theory he postulated that 
there was no radiation from an electron moving in a Bohr orbit, a clear 
violation of electromagnetic theory. In a full quantum-mechanical 
treatment it is the concept of sharp orbits which is set aside rather than 
- electrodynamics. 

But if a particle in a stationary state does not radiate, how do radi- 
ative transitions arise? To answer this we recall that the particle is not 
alone in the universe; sooner or later some photon (or atom or other 
particle) will approach and interact, thereby perturbing the energy so 
that the particle is no longer in a stationary state. Assume that the 
particle was initially in state i and that at t = 0 the perturbing interaction 
was introduced. For £> 0 the wave function for the particle is given by 
Eq. (13.32) in terms of the possible wave functions for the unperturbed 
system. The values of a, are functions of time; for £ < 0, a; = 1 and all 
others were zero. The uncertainty principle does not permit us to know 
the energy exactly and hence the values of the a,’s as functions of time, 
but while the particle is in the mixed state, the expectation value of z, as 
given by Eq. (13.34), becomes 


(z) = vee (> ax*a,y*ry, + oe) ajay oy,e Erem) dr 
' ai (13.38) 


Clearly, the first sum in the integral is not a function of time, but the 
second one is. 


If we assume that the a's are all very near zero excepta* = 1 for the 
initial state and ay ~ 1 for the final state, the only term that gives rise to 
a time-dependent variation of (x) is Gis pray, dz) e(E-Eptih, which we 
shall designate (x)iy. The product of the charge q on the particle and 
(z) is the x component of the oscillating electric dipole moment for which 
the frequency is given by the Einstein condition» = (E —-E,)/h. From 


qla) = 9 ( JS: vay; dz) e“ ECE pII (13.36) 
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we observe that only if ip SA yizy, dz is nonzero can there be a radiative 


transition between states i and f associated with the z component of the 
dipole moment. (Of course, the electric dipole moment is a vector, the y 
and z components of which can be calculated in a manner quite analogous 
to that shown above.) If all three components are zero, electric-dipole 
transitions between states i and f do not occur. 

For example, in the case of the one-dimensional harmonic oscillator, 
dipole radiation is emitted only when n decreases by 1 and absorbed only 
when n increases by 1. Thus-the harmonic oscillator obeys the selection 
rule An = +1, and the only frequency emitted or absorbed is the classical 
oscillator frequency » = »/8/m/2r. 

The oscillating electric dipole is the simplest classical system which 
radiates electromagnetic waves. We shall find that the most prominent 
lines of atomic spectra are electric-dipole lines, although weaker lines 
arising from magnetic dipoles, electric quadrupoles, and higher multipoles 
are also observed. 


13.11 Perturbation Theory 


In most cases the wave equation cannot be solved exactly in terms of 
familiar mathematical functions. Resort must then be had either to 
numerical integration or to approximate methods. The best-known 
method of approximation, and one of wide usefulness, is that known as 
perturbalion theory. In this method certain minor terms are at first 
omitted from the wave equation, the simplified equation thus obtained is 
solved, and corrections to the energy and to the wave functions are then 
calculated to represent the effect of the terms that were omitted. The 
name arose from the analogous procedure of the astronomers, who first 
imagine each planet to move in an elliptical orbit about the sun and then 
calculate the perturbations of these motions caused by the attractions of 
the other planets. 

Let us suppose that the exact nondegenerate eigenfunctions vn and 
eigenvalues E, are known for some problem, e.g., the simple harmonic 
oscillator, so that 


— Be 4 Pawl) = Enla) (13.37) 


Suppose that now the potential energy is changed very slightly to 


P'(z) =P(z) + f(@) 
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Then the wave equation takes the form 


hè dy, r = Ely! (13.38, 
= jn Gar + IPC) + f(a) Wr Vn ) 
Since the potential energy has been changed very little, we expect that 
the new eigenfunctions y, and the new eigenvalues Ey, will differ only 
slightly from yn and En. Since the y,’s represent a complete set ir. terms 
of which any reasonably smooth function can be expanded, we express 


the amount by which vi, differs from y, by the series X” by;, so that 
j=l 
Vi = Yn + Y bhi (13.39) 
7 


where the b;’s are all small if y/ is almost equal to Yn. Substituting Eq. 
(13.39) in Eq. (13.38) and using Eq. (13.37) repeatedly, we obtain 


Enya + ZbjEys + f(a)yn + f(@)Zbp, = E'n + ELZby; (13.40) 


Here, however, the term f(x) 2by; is of the second order of small quantities, 
since f(x) and b; are both of the first order. 
this term. 


Next we multiply the resulting equation by y* dz and integrate over 
the full range of x, remembering that fy7¥; dz = 6,;. This gives 


For a first correction we drop 


(+ bn) En + fT, WESE) de = (1 + ba) EY 
Ey — Ey = (1 + b) fT, vas evn da 


Here the integral is a small quantity of the first order; hence b, produces 


an effect of second order and for consistency must be dropped. We thus 
find for the first-order correction to the energy 


E, — En = ["_ vi f(a)bn dz (13.41) 


The first-order perturbation energy E, — E, is just the perturbation 
unclion averaged over the corresponding unperturbed state. 

To find the 6,’s, we return to Eq. (13.40), with the second-order term 

omitted as before, multiply it through by yf dk with k = n, and integrate 
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as before, obtaining 
(Ex — Ep)bi + f7 vtf@vn dz = 0 (13.42) 


Here we may replace E; by En, since the error thereby introduced, 
(Ej, — E,)bs, is of second order. Thus for a first-order correction 


b= ge [Oba de (13.42a) 


The coefficient b, remains arbitrary and may be adjusted so as to nor- 
malize yp’. 

The integral fy f(x)¥. dz is called a matrix element of f(z) with 
respect to the functions ¥; and ya. Its possible values could be written 
in the form of a matrix with its rows numbered by k and its columns by n. 
Components for n = k are called diagonal components, i.e., located on 
the leading diagonal of the matrix. Thus the first-order correction to E, 
is given by the diagonal matrix component of f(x) with respect to pp. 
This is an important result obtained from perturbation theory. 


13.12 Emission and Absorption of Radiation 


One of the most important properties of atoms and molecules is their 
ability to emit or absorb radiant energy. In classical mechanics, emission 
of radiation results from the electromagnetic field emitted by accelerated 
electric charges, whereas absorption results from work done on the charges 
by forces exerted on them by the electric vector of an incident field. In 
wave mechanics, on the other hand, both emission and absorption are 
associated with transitions of atoms or molecules between quantum states 
of different energies. 


a. Transition Probabilities and Mean Life Let the quantum states for 
an atom all be numbered off in a single series. Then, when an atom is in 
state n, there is a certain probability that during an interval of time dt it 
will jump spontaneously into another state j, with the emission of a 
photon of frequency (E, — E;)/h. This probability is denoted by A,, dl. 
Out of a large number N of atoms in state n, NA,; jump per second into 
state j. 

Consider the history of No atoms that start in staten. If N of them 
are still in that state after a time £, then during time dt a number —dN 
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t 


Fig. 13.6 ‘The exponential decrease by radiative transitions. 


will leave by spontaneous radiative transitions, where 
dN = —Nydl y = B(j)Anj 


Here 2(j) denotes a sum over all states into which a spontaneous jump 
can occur out of state n, i.e., over all states having lower energy. Ina 
particular case, some A,;’s may, of course, be zero. Integrating (see Fig. 
13.6), N = Næ". 

There is no absolute limit to the length of time that an individual 
atom may remain in a given quantum state. The arerage time 7, spent by 
the atoms in state n is called the mean life of an atom in that state. To 
find it, consider No atoms that have just entered this state. Then, during 
each dl, Ny dl of them leave the state after spending a time / in it; and, 
inserting the value of N from the last equation, we can write 


L 


1 © o 
N I W(Noe-"'y dl) = y ji ledi = — 


Hence 


Tpi 


= [2(j) Aa) (13.43) 


R16 


For atomic states involved in visible or ultraviolet emission, 7, is com- 
monly of the order of 10-® s; in the x-ray region it is much smaller. 


b. The Einslein coefficients In 1916 Einstein considered the emission 
and absorption of radiation by quantized systems such as atoms and 
molecules. Assume that there are many identical systems with energy 
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states Æ, and E; which emit and absorb radiation of frequency 


En — E; 
7a aa es 


(note that here En > E;). They are immersed in a bath of black- 
body radiation such that `U, d» represents the energy density of 
radiation with frequencies between vand y + dy. Einstein assumed that 
the probability a system in state j will absorb a photon is proportional to 
U, (vn); thus the probability is B;,U,(v,;), where Bj, is Einslein’s coefficient 
of absorplion. Einstein’s great insight led him to write the probability 
that a system in state n will emit a photon hy,; as the sum of the transition 
probability A,; discussed above and an induced transition probability 
proportional to U,(v,;). Thus the probability of emission is 


Anj + BnjU,(vnj) 


where A,; and B,; are respectively the Einstein coefficients of spontaneous 
and induced emission. 


In equilibrium the number of systems N, in state n and the number 
N; in state j remain constant. The number of transitions per unit time 
from n to j is equal to the number from j to n, whence 


NilAns + BuU, (vn)] = N;BinU,(vn3) 
However, by Eq. (4.20), we have 
N; = Npe EpDIkT = N pehrnilkT 


and therefore 


An; 
U, (vn) = Bpi — B, (13.44) 


Comparison of Eq. (13.44) with Eq. (5.21) for blackbody radiation 
density reveals that the three Einstein coefficients must be related by the 
equations 


Bin = Bn; (13.45a) 
8rh 
Any = a Bay : (13.456) 


The fact that photons of frequency vn; not only raise systems in state 
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j to state n, but also induce systems in state n to make radiative tran- 
sitions to state j, is widely exploited in modern technology. The great 
intensities and coherence available in laser and maser beams are achieved 
by increasing the population of a suitable energy state greatly over 
its thermal-equilibrium value and stimulating transitions to a lower level. 


Appendix 13A - The Harmonic Oscillator 


The Schrédinger equation for the harmonic oscillator, Eq. (13.27), may 
be written 


2 
by _ 2m Gpr? — my = (£ a ar) y (13A.1) 


We now make the substitutions 


4 
v= are = ak (13A.2a) 


2E |m 
eee is ä 2i 
€ A qE (13A.2b) 
to obtain the dimensionless equation 


h = (y? — oy (13A.3) 


For a sufficiently large value of y, y? >>, and we may write Eq. 
(13A.3) in the asymptotic form d*f/dy? = y?y, which is satisfied by 
y = e'i, To satisfy the normalization condition y must vanish as y 
goes to infinity; therefore we reject the positive sign and seek an exact 
solution of Eq. (13A.3) of the form y(y) = Ce-'!?H(y), where H(y) is a 
function to be determined and C is a normalization constant. Inserting 
this into Eq. (13A.3) reveals that H(y) must satisfy 


aH dH 
a? Idy e DH=0 (13A.4) 


One way of determining H(y) is to assume that it can be written 
as a power series in y: 


H(y) =ao+ay+ay?+--. = D any” (13A.5) 


n=0 
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where the a’s are constant coefficients yet to be found. Substitution of 
Eq. (13A.5) into Eq. (13A.4) yields 


>, yn 1)(n + 2)any2— 2nd, + (e — 1)a,] =-0 (03.6) 


which can be true for every y only if each coefficient in the expansion is 
identically zero, or 
Rigen Seas 
Ante = @+Dm+) an (13A.7) 


This recursion formula permits one to write H(y) in terms of two coeffi- 
cients @ and ai. However, the normalization condition eg. y*ydy =1 
can be satisfied only if H(y) terminates at some value of n (Prob. 12). 
This happens if two conditions are satisfied: 

1. For some specific n, @n42 = 0, which occurs if 


2E |m 
2n+1l=e= TNE 

2.'If the n of condition 1 is odd, ao must be zero; if this n is even, 
a, must be zero. 

Condition 1 restricts the allowed values of the energy of the oscil- 
lator to those given by E, = (n + 4)h+/B/m. The classical frequency 
of the harmonic oscillator is » = 1/8/m/2z, and so the allowed energies 
are 


E, = (n+ 2)hv (13A.8) 
Condition 2 limits the eigenfunctions y, to be either even or odd 
functions of y. The functions H,(y), known as the Hermite polyno- 


mials, are even or odd as nis even or odd. They can be obtained from 
the generaling function 


S(s) = tw = $ He) pn (134.9) 


eye tel 


or alternatively from 


Ho) = (ye EE asa, 
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—— Appendix 13B - Proof of the Orthogonality Relation 


Let ¥,(x) and y;(r) be normalized eigenfunctions of the Schrédinger 
wave equation so that 


In an + Pe = Enba TETO 
ENA 
he oy + Py, = En: (13B.15) 


Since everything in both equations is real except possibly the eigenfunc- 
tions, the complex conjugate of y; obeys the equation 


h? d*y* 
-È TH + py = Eny (13B.2) 


If we multiply (13B.1a) by ¥* and (13B.2) by y, and subtract, we have 


aca * dyn — y, dy? 
Í dz? " dz? 


2m 
+ PWitn — Yny?) = (En — Ej)y*y, (13B.3) 


If we integrate both sides from z = — œ to z = oo, we find 
© dy, d*y¥* oe d dy, dy* 
- £ ji a n 
J (v daz ~ Yr Ti) d fa’ We — Yn Gee) de 


2 © 
= — Fr (En — E;) f7 Wha de 


or 


dbn dž] _2 B 
[w ae to Fe | = FF (E — Ba) [vide de 


The normalization condition requires that y, and ¥% go to zero at 
z= +, and therefore fe Vjvndx = 0 if E; = E, If y7 and yn 


are degenerate so that E; = E,, the situation becomes more compli- 


cated; nevertheless it is alwa 5 
; ys possible to make di id 
functions orthogonal. egenerate eigen 


Problems 


1. Solve the problem of the infinitely deep square well with the origin 
at the center of the well and P = 0 for —L/2 < z < L/2. Show that 


A 
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the solutions are alternately of even and odd parity. Prove that the 
eigenvalues of the energy are the same as those given by Eq. (13.8). 
2. Find the commutator of the operators (a) 2 d/dz and z? d/dzx, (b) 
energy Ê and time t. 
3. (a) Find the five lowest kinetic energies which are allowed for an 
electron in a cubical box 1 cm on a side. 
(b) Repeat for a cubical box 3 A on a side. (Note that this is 

roughly the average volume occupied by an atom in a solid.) 

Partial Ans: (a) Ey = 1.13 X 10-4 eV; Em = 4.53 X 10-4. eV; 
(b) Ein = 12.6 eV 
4. Find the eigenvalues and eigenfunctions for the operator d?/dx? for 
the interval 0 < x < L if the functions u;(x) on which it operates satisfy 
the boundary conditions du/dr = 0 at z = 0 and u = 0 at z = L. 


5. Show that u(z) = Ae-= is an eigenfunction of the operator 


decid elo 
C | 
and determine the corresponding eigenvalue. 
Ans: b? 
6. (a) Show that if the hamiltonian operator can be written as the sum 
of terms, each of which depends on a different set of coordinates so that 


A= Ai (xu,y1,21) + Fi,(a2,y2,22) + A.(r,0,) + `- - , the eigenfunctions 
can be written as a product of terms 


Y = palt1Y1,21) YlE2Y2,22) Pelr, 0, p) ©- - 


(b) Show further that the energy corresponding to y is Ea + 
E+ E.+ ++, where E, is the energy eigenvalue associated with 
Ya, etc. 


7. A particle of mass m is bound in a square well of width W and height 
Po. The eigenfunction for the ground state is given by 


y(x) = B cos a) 


for —W/2 < z < W/2 and by y = Ae! outside the well. 

(a) Taking m and W as given, find a, E and Po. Compare the 
value of E, with that of the infinite square well. 

(b) Find A in terms of B and the given quantities, 

(c) Set up the integral from which B can be calculated. 


Ans: (a) 7.72/W, 2h?/25mW?, 21h?/25mW?; (b) 14.7B 
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8. A particle of mass m moves with potential energy P(x) = © for 
x <0 and 362? for x > 0. Find the eigenfunctions and the allowed 
energies. 

9. Find the expectation values of the kinetic and potential energies 
of a harmonic oscillator in its ground state. 


Ans: hv/4, hv/4 


10. A three-dimensional harmonic oscillator moves in the potential - 


P(a,y,z) = }mw?(x? + y? + 7°). Separate the Schrödinger equation and 
show that the allowed energies are given by E = (n + $)hw. 

11. Is u(x) = re? an eigenfunction of d?/dz? — z?) If it is, find 
the corresponding eigenvalue. 


12. Show that unless H(y) terminates for some value of n, 


y= env'/2 $ any” 


n=0 


goes to infinity for y— œ. Hint: Show that H(y) behaves like e 
for large y. 
13. Show that when ¥(z,{) = 2a,¥,(z,l), (E) = 2a*a,E, and thus that 
axa, is the probability of obtaining the value E, as the result of a single 
measurement of the total energy E and hence is the probability of finding 
the particle in the eigenstate n. 
14. A particle is confined to move in the z and y directions under the 
influence of the potentials P(z,y) = Po(x) + Pi(y), where Po(x) = Bx?/2 
and P,(y) = 0 for —a < y < a and P;(y) = œ for y < —aandy>a. 
(a) Find the eigenfunctions by solving the Schrödinger wave 
equation. 
(b) Find the allowed energies for the particle. 
15. It is desired to calculate the expectation value of the product 
of position z and momentum p;. Show that neither 


nor 


is acceptable since both lead to imaginary values. Show that 


nie oa 
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leads to a real value. Does (xp) = ({)(p)? Hint: Integrate by parts. 
16. A electron is bound in an infinite square well such that P = 0 for 
—L/2 < z < L/2 and infinity elsewhere. Ifat some instant the particle 
is in the state represented by y(z) = +/2/L for |z| < L/2 and 0 for |z| > 
L/2, calculate the probability that a measurement of the energy will give 
the result (a) h?/8mL? and (b) h?/2mL?. 

17. A particle is in the potential field P(x) = 382? of a classical simple 
harmonic oscillator. Calculate the probability of finding the particle out- 
side the classical limits of its motion if it is in the ground state. 


Ans: 0.16 
18. A series of measurements on a one-dimensional system leaves the 
system with the wave function ¥(z) = A sin rr/L for —L <x < Land 
zero elsewhere. bs 

(a) Find the normalization constant A, 

(b) Compute the probability that a measurement of the momentum 
pz for the system yields the value p,, where p, is an allowed momentum 
of the system. Plot the probability against pa L/h and show that 
Ap, Ar = À. 

(c) Calculate the expectation value (p) and compare with the graph 
of part (b). 

19. Show that (a) yH,(y) = nHn) + Hari0), (b) dHa(y)/dy = 
2nH,-1(y), and (c) dH,(y)/dy = 2yH,(y) — An+(y)- 

20. An electron moves in the potential well P(x) = — ô for —a < z <0 
and P(x) = ô for 0 <a <a (Fig. 13.7). Use first-order perturbation 
theory to compute the first four energy levels. Sct up the expression for 
the first-order expansion coefficients for the lowest energy state. 


P(x) 
’ 
¥ j ô 
ô atx 
t-a 0 
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Py 


0 
o L x 


‘ig. 13.8 


1. A particle is in the potential field shown in Fig. 13.8. 


(a) Solve the Schrédinger wave equation and find the eigenfunctions 
or the bound states. 


(b) Prove that the eigenenergies of the bound states are solutions of 


he equation —R=kcotkL, where k= V2mE/h and R= 
/2m(P; — E)/h. 

(c) What condition is placed on P, 
e only one bound state for the system? 
2. Prove that the Hermite functions y, = N,e"!?H,,(y) are orthogonal 


nd show that the normalization constant which makes jh vn2(x) dz = 
is 


by the requirement that there 


3. Prove that (£)„m for the harmonic oscillator is zero unless n = 
+ 1 and thus that transitions occur only between adjacent energy 
vels. Show that (£), ny = Vn F 1)/2a? while (z)nn1 = V/n/2a. 

| Show that for a harmonic oscillator in its nth eigenstate Az Ap, = 
+ z)ħ, where Az and Ap, are defined in Sec. 12.6. 


chapter fourteen 


Wave Mechanics III 
The Hydrogen Atom 


An understanding of the structure of the hydrogen alom is of primary 
imporlance, nol so much because h drogen is the simplest atom and was the 
first lo be treated, bul because the treatment of more complex aloms is lypically 
based on hydrogenic wave funclions. “The Balmer formula yields the wave- 
lengths of the visible lines of hydrogen, and the Bohr model provides a more 
complele prediclion of energy slales and spectral lines, bul neither forms a 
usable foundation for understanding the speclra, 


structure, and inleraclions 
of other aloms. 


14.1 The Schrödinger Equation for a One-electron Atom 


The hydrogen atom is a sysiem consi 


sting of an electron and a proton 
bound by clectrostatic attraction. 


Similarly an ionized helium atom and 
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a doubly ionized lithium atom are composed of a positive nucleus and a 
single electron. If r is the distance between particles, the electrostatic 
potential energy P for the system is 


Ze? 
Areo ; (14.1) 


where Ze = nuclear charge 
—e = electronic charge 
eo = permittivity of free space 
Let M and m be respectively the masses of the nucleus located at 


(x1,¥1,21) and the electron at (z2,¥2,22). The Schrödinger equation for 
this two-particle system is 


Hy [iti eta T 
2M (Ge sa dy1? T Ta) 


_ A (ar , dr , ay 
2m ( 3H aye + t) + Pýr = Ebr 


aa eet sunation of 1, Yı, 21, T2, Y2, Z2; the subscript T of yr and Er 

nee : a al, ene that they refer to the complete system of nucleus 
electron. e now replace the six variables b: i 

y the three coordinates 

(e,Yey2e) Of the center of mass of the system and the three coordinates of 

the electron relative to the proton by making the substitutions 


eet Ma; + mao 
4 M4m 
n My: + myz 

ZAE MEEN 

MA VERZ 
i M +m 


Wee a a x 
s Pods T2 — fı, rsinésing = y2— y, and rcos@ = z: — zı. 
chrödinger equation then becomes 


s h? dyr , Ir əy 42M 
2(M +m) (a+ aye T =) 5 Mn E al e) 
NEEE, 
m Je See: T IRO? 2 
r? sin 6 00 (sin G 2) ar r? sin? 0 = = x Yr = Expr 


f Ais eduetion can be separated by expressing Vr(TeaYoZer,0, g) as the 
progu y a finetion We Of Zes Ye, Ze alone and a function y of S 8, gy only- 
pon substituting Yr(z.,y.,2.,7,0,0) = Ve(LesVe,2e)¥(7,8,) in the Schrö- 
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dinger equation and dividing through by yy, it is found that the resulting 
equation is the sum of two parts, one of which depends only on £e, Ye, Ze 
and the other only on r, 6, œ. Therefore each part must be equal to a 
constant, and we obtain the two equations 


h? Oe , pe , Ie 
2(M + m) (rê Taye? aze) = Hits ao) 
i?(M +m)[1 ð oy ln Ovi 
2Mm E ðr G or + rsin 6 00 \o" 9 00 
1 oy Ze? 
r? sin? 6 w] Areor vy Ey (14-3) 


where E = Er — Ey. 

Equation (14.2) represents the motion of the center of mass, which 
behaves like a free particle of mass M + m with translational kinetic 
energy Ew. For our purposes the translational energy of the atom is not 
of interest; Z,, can take on any positive value (or zero). 

Equation (14.3) is the wave equation of a particle of mass, Mm/ 
(M + m) under the influence of the potential function of Eq. (14.1). 
The quantity 


Mm m 
m = ype = TSM (14.4) 


is just the reduced mass of the electron (see Sec. 9.4). Indeed, in the 
limit as M goes to infinity, m, approaches m and Eq. (14.3) becomes the 
wave equation of an electron bound to a fixed origin with the potential 


energy of Eq. (14.1). It should be observed further that Eq. (14.3) 
may be written 


y = Ey (14.3a) 


since the quantity in brackets in Eq. (14.3) is V? in spherical polar 
coordinates. 


14.2 Separation of Variables 


The advantage of writing the Schrédinger equation for the one-electron 
atom in spherical coordinates is that in this form it can be readily sepa- 


rated into three equations, each involving a single coordinate. To do 
this we assume that 


¥(7,8,9) = R(r)O(6)#(y) 
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‘ 
If we substitute this equation into Eq. (14.3) and multiply by 
2m,r? sin? 6/(h?RO®) 


we obtain 
H 2, 
sin? 0 d dR sin 6 fo (si 92 1 d$ 


Src), O (Oda 
ores "(Es m z) =0 (145) 


The third term of Eq. (14.5) is a function of p alone, while none of 
the other terms involves y atall. If Eq. (14.5) is to hold for all values of 
y, this third term must be a constant, which we shall designate as — m:?, 
and & must obey the equation 


as Per ae (14.6) 


If we substitute —m,? for the third term in Eq. (14.5), divide by 
sin? @, and rearrange, we arrive at the result that 


ld dR 2m,r? f Ze? m? 
nala) +a i +8) = gin? 0 
ut dO 


d/. 
mich (sin 6 70 (14.6a) 


Here the left side is a function of r alone, while the right depends solely on 
0. Therefore, both sides must be equal to some constant which is 
written as I(l + 1) for reasons which will appear later. If we equate both 
sides to I(l + 1) and rearrange, we find that 


E afa id 2 
and wi (sin 0%) + [i +)- 44 3] 0=0 (14.7) 
ld/.dR 2m, f Ze? (l+1 
arlet) + [a (+ B) z 1] Reoary 


In Egs. (14.6) to (14.8) we have three differential equations, each of a 
single variable, from which R, ©, and ® can be found. Once they are 
known, we can immediately write y, which is R(r)O(6)8(y). 


14.3 The Wave Functions and Energy Levels 


A solution of Eq. (14.6) may be written immediately 
P(e) = Ceme 
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where C is a constant of integration. The condition that the wave 
function be single-valued (Sec. 12.2) requires that m: be 0 or a positive or 
negative integer, since increasing y by 2r brings one back to the same 
point in space, so that (%1) = (yı + 27). If we let m, take on the 
values 0 and all positive and negative integers, and if we normalize &(¢) 


so that ie ğ*ğ dy = 1, we have finally 


SS eo ET (14.9) 


where m is known as the magnetic quantum number. 
To solve Eq. (14.7) it is customary to let w = cos 6. Upon making 
this substitution, Eq. (14.7) becomes 


d » dO _ mè £ 
E [a — u’) A + K +1) l ©=0 (14.70) 


for which finite, well-behaved solutions are found only if lis an integer 
equal to or greater than m. The integer l is called the orbital (or azi- 
muthal) quantum number. The solutions of Eq. (14.7a) are the associ- 
aled Legendre polymonials; we write Om(0) = NimP? (cos 0), where Nim is 
the appropriate normalization constant and P? is the associated Legendre 
polynomial for the chosen land m; Values of P?*(cos 6) for a few values 
of l and m; are listed below. 


PYST P} = sin 0 

PÌ = cos 6 P} = 3 sin 9 cos 0 

P% = 4(3 cos? 0 — 1) P} = 3 sin? 6 

P5 = 4(5 cos? 0 — 3 cos 6) P} = $sin 0(5 cos? 0 — 1) 
P = 15 sin? 6 cos 6 P} = 15 sin? 0 


Next we consider the solution of Eq. (14.8) with attention focused 
on bound states (E negative). In Appendix 14A it is shown that bound- 
state wave functions are possible only for energies given by 


Boe mez | N mez 
iig 3B2r?eohn? 8Beoh?n? 
13.6722 
SS aes eV OSV ay es (14.10) - 


which are just the values previously found by Bohr. 


y In addition, any 
positive value of EF is allowed. 
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The radial wave function R(r) for a given value of the radial (or 
total) quantum number n and the orbital quantum number l is the prod- 
uct of (1) a normalization constant, (2) e~*"/?, where a = 2Z/naz (az is 
the Bohr radius), and (3) a polynomial in ar. The complete wave func- 
tion ¥(r,6,) is accordingly given by Ynim(r,8,e) = Nnue-*"!? (polynomial 
in ar) (polynomial in cos @) e*™*, where I, is the normalization constant 
such that fffy*y dU = 1 (dv is a volume element). 

As examples, the following relations give the wave functions for the 
states of lowest energy in terms of the Bohr radius ag (= 4reoh?/m,e?): 


1 4 
Vioo = —= 2) ensten n=1 
l=0 
m=0 


Wael, ZN Za aie 
W200 zyr lE) (1- £) erm a 
m = 0 


il ZN} Zr 
= —< — Zr | 2 = 
ven = rae (Ge) eerie Lm 


1 4 
You = (2) Aà e~7rl2ea sin 0 eżie m = +1 


4+/x \an] 2an 
choke ZN Z Zr Nil tee 
to = sre (a) [8 Sa +2 (se) [ew 
n=3 
t=0 
ee m=0 (14.11) 
2 (Z\!) Zi 
tue = (5) = ke-2) e-271305 cog 0 
l= 
m= 
N 
E j 
m = +1 
= 1 ZNi ( Zr\2 
Pazo = EIE (2) (=) e-211308(3 cog? 0 — 1) 
b= 
m = 0 
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f] 2 
Yani = 1 6) (£) e~77/322 sin 0 cos 0 e+e 


9 Vx \a 3ap 
m= +1 
1 Z\ (Zr \? 4 F 
Yan = (2) x) e-71/82n sin? Q ete 
18 x \ae/ \3azn 
m = +2 


14.4 Probability Density and Charge-cloud Density 


The probability density corresponding to one of the wave functions 
Ynim(7,4,¢) is given by y*y. The quantity |y|? dU represents the proba- 
bility that if the electron could be located experimentally at any instant, 
it would be found in the volume element dU. It is not possible, however, 
to follow the electron in an orbital motion around the nucleus by means 
of a succession of such observations as the astronomers follow the planets 
around the sun; for one observation with a y-ray microscope of sufficient 
revolving power would suffice, because of the Compton effect, to eject the 
electron from the atom. 

There is no suggestion here of orbital motion, but often it is con- 
venient to imagine the electronic charge to be distributed in space as sort 
of a charge cloud with a density 


1=ep*y 


: Many effects of the atom on its surroundings are approximately the same 
as if the atom actually contained a distribution of charge of density n. 
Rather than following a fixed orbit, an electron is said to occupy an atomic 
orbital determined by atm, 

Since ¢ appears in y as e'™, it is clear that ¥*y is not a function of A 
Consequently, when the electron is in one of its quantum states, the prob- 
ability density (or charge-cloud density) has cylindrical symmetry about 
the chosen polar axis and lvl? = R?0?. Polar graphs of the function 
O?im, (Fig. 14.1) reveal that the variation with 9, although symmetrical 
relative to the plane 6 = 7/2, is large unless! = 0. The latter leads to a 
spherical distribution. The angular variation of charge density is of 
great interest when one approaches the problem of binding between 
atoms in terms of atomic orbitals. 

For discussing the average radial variation in charge density let 
©, dr denote the total probability (and ndr the numerical amount of 
charge) contained between two spheres of radii r and r + dp drawn about 
the nucleus as a center. The volume between the two spheres is propor- 
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1=0,m,=0 mı = 0,1 m; = 0,1,2 
s electrons p electrons, 1 = 1 d electrons, | = 2 
1=3 ca > < >< > 
m =0 1 2 3 
g electrons Ñ 2 Sie 
l=4 A 
m =0 1 2 3 4 


Fig. 14.1 The probability-density function |®rmi|? as a function of 0 for l = 0, 1, 
2,3, 4. The length of a straight line from the origin to any point on a given curve 
is proportional to the probability that the electron is in the direction of that line. 
(The figures are not drawn to the same scale.) 


0.8 
os rRio 
0.4 Is 
0.2 
0 l 
0 2 4 
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Fig. 14.2 The radial function rR, for three quantum states of ydrogen 
as a function of r in Bohr radii. i 
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Fig. 14.3 The radial probability density @(r) = mR,’ 
states of hydrogen as a function of r in Bohr radii. 
orbital is drawn twice for comparison.) 


* for six quantum 
(The curve for the 2p 


tional to r*. Hence, if Ry is separately normalized so that 


He @, dr = f Ra? dr = 1 


O, = Ra and nr = eO, = er? Rn? 
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In Fig. 14.2 values of rR, are plotted for three states for hydrogen, 
and in Fig. 14.3 are shown the corresponding curves for @, or 7,, which is 
proportional to r?R,.?. The areas under the latter curves would all be 
equal if they were plotted to the same vertical scale. The curves are 
labeled in the notation of the spectroscopists, the letters s, p, d, f, aot 
being used to indicate l = 0, 1, 2,3, . . ~ » preceded by a number giving 
the value of n; thus values of l are designated by ps 


4 5 


The reason for the choice of letters is stated in Sec. 17.2. All the radial 
factors Rn. show n — l numerical maxima, with intervening points of zero 
value ifn — l > 1. 

Although nothing in the wave functions suggests the former Bohr 
orbits, a certain correspondence between them can be traced. In states 
with / = n — 1, the maximum value of the radial charge density n occurs 
at a value of r equal to the radius of the corresponding Bohr circle. In 
these states, as is evident from Fig. 14.3, the density forms a single broad 
hump; it is most widely distributed in the s states (l = 0). It is evident 
from the formulas that for s states y does not vanish at the nucleus 
(r = 0); this fact tends to give special properties to these states. 

As n increases, the curves for @, spread out more and more, just as 
the Bohr orbits did; the atom swells, so to speak. If Z increases, on the 
other hand, the atom shrinks in inverse ratio to Z. 


14.5 Orbital Angular Momentum 


When the atom is in a definite quantum state, the probability density 
W*¥ or y*y is everywhere constant in time. Thus it might be said in one 
sense that the electron is not in motion. Certain phenomena suggest, 
however, that, provided 1 > 0, the electron should be regarded as possess- 
ing angular momentum about the nucleus; this momentum is associated 
with magnetic effects as is that of the classical motion inanorbit. Accord- 
ingly, for convenience, we shall speak of the electron as “moving” in the 
field of the nucleus, and we sometimes refer to this motion as orbital. The 
value of the angular momentum furnishes a useful means of distinguishing 
between states that have the same energy. 
When a rigid body is rotating about a fixed point, like a top about 
its point, there will be a certain axis through the fixed point about which 
the angular momentum of the body is a maximum. About any other 
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axis, inclined at an angle 6 to the line of maximum angular momentum, 
the angular momentum is equal to the maximum value multiplied by cos 6. 
The angular momentum can be represented by a vector drawn in the direc- 
tion of the axis about which it is a maximum; the length of this vector 
can be made to represent the magnitude of the maximum angular momen- 
tum, and its direction is taken in the-direction in which a right-handed 
screw would advance along the axis while turning about it in the same 
direction as the rotating body. The. angular momentum about any 
other axis drawn through the fixed point is then represented by the com- 
ponent of this vector in the direction of that axis. 

A similar treatment can be given to the angular momentum of a 
moving particle. In classical theory, the electron can be regarded as 
moving at every instant in a certain plane drawn through the nucleus; 
the vector representing its angular momentum about the nucleus is com- 
monly drawn perpendicular to this plane (see Fig. 14.4). Let the orbital 
angular momentum be represented by Ay. 

Equation (13.17) gives the operator for the square of the angular 
momentum in cartesian coordinates. If one makes the substitutions 
z=rsin@cosy, y = r sin ĝ sin ọ, and z = r cos 6, one obtains for 
this operator 


AN AE 1 a 
Ar Ar = —h? EHG ET tamas] (14.12) 


If this operator is applied to the hydrogen wave function 


Vatm = RnD im, eie 
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we obtain 


1 G) ( ELS) m? 


Pr ; a 
Kareem = h| — naa sin 635 + in ROeime 


The quantity in the brackets is, however, just the right side of Eq. (14.62), 
which has the value I(l + 1). Thus Ar- An = Ì?I(l + 1)y, so that the 
eigenvalues of A? = Ar: Ai are given by Eq. (13.18); A? = I(l + Dh’, 
and the magnitude of the orbital angular momentum A; has the possible 
values 


Ar=VilFDA 1=0,1,2,--- (14.13) 


According to quantum mechanics (and experiment!) the hydrogen elec- 
tron in its ground state or any other state with l = 0 has no orbital 
angular momentum! 

If the operator for angular momentum (A,), about the z axis as 
written in Eq. (13.16a) is applied to Yaim, the result is 


YN 
(Ai) :Wnim, = #2 Yanim = MhWnim; 


r 


Thus the wave function Wnim, represents a state of the electron in the 
nuclear field in which its angular momentum about the axis of polar 
coordinates has the definite value 


(Ad): = mà (14.14) 


The angular momentum (A;), about the polar axis ranges in integral 
steps from a maximum of l units, each of magnitude f in one direction 
through zero to l units in the opposite direction. This is illustrated in 
Fig. 14.5 for the case in which l = 2, and the orbital angular momentum 
is y“ h. The vector representing A; is shown in five alternate positions 
corresponding tom, = —2, —1,0,1,2. The orbital angular momentum 
A; is allowed to take on only such orientations relative to a field defining 
the z axis such that the z component of A; is an integer m; times à. The 
relationships involved here are sometimes referred to as space quantization. 

If the operator representing a perpendicular component of ‘the 
vector angular momentum A; is applied to nimp the result is not ish 
Ynim multiplied by a constant. It is a general consequence of. the 
indeterminacy principle that, whenever one component of a vector angular 
momentum has a definite value, other components do not. Measurement 
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Fig. 14.5 For | = 2 the: orbital- 

- angular-momentum vector of magni- 
tude /6A must be oriented such 
that its z component is nuh, where 
m = 2,1, 0, —1, or —2. 


of a perpendicular component for an atom in the state represented by 
Ynim, would yield one of several alternative multiples of å (provided 
l > 0), with an average expectation of zero. This is an important navel 
feature of wave mechanics; for in classical mechanics A: would have a 


fixed direction and all its components would have definite values (so` 


long as external forces did not act on the system). The classical vector 
A: might, however, have any orientation about the axis while its axial 
component had a given value mh. Thus a quantum state is the analog 
not of one classical motion but of a class of motions. An absence of 
fixed orientation and of fixed values for perpendicular components would 
occur if the vector A: were revolving about the axis at a fixed angle. It 
sometimes assists the memory to imagine such a “precession” to be 
occurring. It should not be forgotten, however, that in wave mechanics 
the indefinitenesses in question hold even in the absence of all external 


forces. 


14.6 Relativistic Effects and Electron Spin 


The theory described so far has been a nonrelativistic one suited to 
cases, of small energy only. Like newtonian mechanics, it requires 
modification to bring it into harmony with the principles of relativity. 
There are two principal correction terms, One associated with introducing 
the relativistic expression for kinetic energy and another arising from 
electron spin. 
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The first correction can be calculated approximately from a per- 
turbation approach, using the Schrédinger equation. If the momentum 
p is much less than mc, the relativistic kinetic energy K is given by 


2 4 
K = (p? + mèch) — me = i - ao ap ons (14.15) 


When this value for K is substituted in the hamiltonian of Sec. 13.6, 
the quantity p‘/8mc? is a perturbation term. When the operator AV /i 
is substituted for p and the three-dimensional analog of Eq. (13.41) is 
evaluated, it is found that the relativistic correction AE, must be added 
to the unperturbed energy En, where 


_ Rhea?Z! ( 


AE, = E — E, Ta (14.16) 


where R is the Rydberg constant and æ the fine-structure constant. 

In addition to this relativistic correction the spin of the electron, 
introduced by Goudsmit and Uhlenbeck (Sec. 9.9) before the advent of 
wave mechanics, appeared as an astonishing feature of a relativistic 
quantum theory developed in 1928 by Dirac. He showed that the most 
natural way to bring wave-mechenical theory into harmony with the 
theory of relativity is to adopt quite a different wave equation and that 
when this is done, the new equation leads automatically to effects equiva- 
lent to those deduced from electron spin. ït need not be assumed that 
the electron is spinning or turning on its axis. According to this theory 
of Dirac’s, the electron does behave as if it had an internal angular 
momentum of the sort just described and an associated magnetic moment. 
Furthermore, if a wave packet is formed representing an electron, there 
is in it, in general, something like a closed current, or eddy, of probability 
that can be regarded as analogous to an actual spinning motion. 

In the Dirac theory there are four wave functions for an electron 
instead of one. The complete Dirac theory is seldom used, however, in 
spectroscopic calculations. A simplified approximate form usually suf- 
fices, in which only two functions, Y, and Ys, are used, one for each 
direction of the spin moment. For a first approximation, the effect 
of the spin magnetic moment upon the energy may be ignored. When 
this is done, the separate wave functions for a one-electron atom may 
be simply the functions described in the last section, except that a fourth 
quantum number m, is now to be added; a time-free wave function 
including spin may be denoted by the symbol Yntnm, Since m, may 
have either of two values, the effect of spin in this approximation is 

n double the number of quantum states, A 
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14.7 The Spin-Orbit Interaction 


By virtue of its spin, the electron has a magnetic moment (Sec. 9.9) 
of magnitude p. = —2(e/2m) »/(@)@) à. We may then think of the 
electron as a small magnetic dipole of moment y, directed opposite the 
spin. For simplicity consider a one-electron atom with nuclear charge Z. 
If the electron is moving relative to the nucleus, from the point of view 
of a reference frame in which the electron is momentarily at rest, the 
nucleus is moving with velocity v and produces therefore a magnetic 
field B: at the location of the electron. By Eq. (3.34) the magnetic field 
seen by an electron by virtue of its own orbital motion through the elec- 
tric field Ey of the nucleus is 


1 A, dP 
|B:| = | zx Es) | = sree a (14.17) 
where P = — Ze?/4reor and we have made use of the fact that the orbital 


angular momentum A; = r X my. 

The energy of a magnetic dipole in a magnetic field varies with 
orientation, being least for the dipole moment parallel to the field. The 
orientation energy, referred to as energy zero when the magnetic moment 
and field are mutually perpendicular, is given by 


E,s = —»B cos (u,B) = ~y Bi 


For the spinning electron in the field of the nucleus, we have by Egs. 
(9.21) and (14.17), 


AM NHN, (14.18) 


Eun = Areom?c?r? ~* 
The calculation above is still incomplete because the electron is 
. being accelerated, and hence the frame of reference in which it is momen- 
` tarily at rest changes continually. It was shown by Thomas that this 
circumstance reduces #5 to one-half of the value given by Eq. (14.18). 
When this reduced value is introduced in the hamiltonian and the per- 
turbation calculation is made, it is found that there is a shift AE), in 
energy due to the spin-orbit interaction of 


Bye oh be ZA, Ar om i 
A bly = seat PCED) (14.19) ` 


4 


where az is the Bohr radius (l = 0). 
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Fig. 14.6 Asa result of the spin-orbit interaction, A; and A, 
precess around Aj, which remains fixed in space in the absence 
of torques acting on it. 


The torque v: X B, does not change the magnitude of the spin 
momentum A, but alters its direction. The resultant angular momen- 
tum, A: + A, = Aj, remains fixed both in magnitude and in direction in 
the absence of an imposed magnetic field; meanwhile the vectors A; and 
A, precess around it (Fig. 14.6). 


14.8 The Quantum Number j 


The resultant angular momentum A; can be shown to possess only the 
quantized values j(j + 1)ħ? for its square, where j =l1@s =1 +2. 
Only positive values.of j are allowed, so that for / = 0, j = 4 only. The 
symbol ® is used to represent a quantized vector addition such that 
a@b has the values a+b, a+b—1,..., |a— b|. Thus the 
resultant (spin + orbital) angular momentum for the electron has the 


magnitude 
A;s=Vi(jtDi with j = |l + 4| (14.20) 
The z component of A; must have one of the quantized values 
(Aj): = mà —jim<j (14.21) 


where m; takes on the integrally spaced values j, j— l, .. 
Thus for j = 3, m; = 3, 4, —}, —%. 

Before the spin-orbit interaction was included in the hamiltonian, 
the quantum states of the single-electron atom could be specified in 
terms of the four quantum numbers n, l, mi, and m,; for these states 
fixed values are assigned to the z components of A, and A; separately. 
With the spin-orbit interaction taken into account the true quantum 
‘states are characterized by a fixed value of A; and of (Aj):. The appro- 
priate four quantum numbers are then n, |, j, and m;. The indefinite- 


2 TI 
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ness of all components of A; and A, when A; is quantized is an example 
of a general principle. When the vector sum of two or more angular 
momenta is quantized, the individual momenta cease to have any definite 
components, although their squares retain definite magnitudes, for 
example, ¿(l + 1)h?. This corresponds to the fact that in classical 
mechanics the individual vectors may assume various directions without 
disturbing their vector sum. 

The energy depends now on j, as well as on n and l. Thus each 
nl level with | > 0 is split by the spin-orbit effect into two. From the 
fact that Aj; = Aı + A, we have Aj: Aj = (Ar + A.)-(Ar+A,) = 
A? + A? + 2A. Ar or f 


A,+ Ai = A, Aı cos (A., A:) = 3(A;? — Ai? — A,?) 
LUGE 1) — W(t+1) —s(s +1) ,, 
or 2 


(14.22) 


Substitution of Eq. (14.22) into Eq. (14.19) leads to the result that the 
shift AEn in energy level due to the spin-orbit interaction is (l= 0) 


ings = eh? ZU + 1) — K+ 1) — s(s + 1)] 
£ = Ibrem?can? nl(l + 3)(1+ 1) 


(14.23) 


where the first fraction is Rhca?/2, or 3.62 X 10-4 eV. For states with 
l> 0 the spin-orbit interaction splits each energy level into two, raising 
the energy for j = 1+ and lowering it for | — z- To distinguish 
between these levels, one commonly adds the value of J as a subscript 
to the letter used to denote /; thus the two levels arising from the splitting 
of the 3p level are written 3p, and 3p;- 

Since s states have no orbital angular momentum (l = 0), there is 
no spin-orbit correction for an s level. However, in the relativistic 
theory the wave equation contains other small terms besides those 
giving rise to the spin-orbit effect. These terms cause the energy to 
vary somewhat with l; in particular, there is a so-called special spin 
correction for l = 0 states. As a consequence of the corrections neglected 
here, the fine structure of the hydrogen levels is complex. It has been 
the subject of intensive experimental and theoretical research which has 
been fruitful in elucidating several subtleties of nature. 


14.9 Spatial Degeneracy of the Wave Function 


For a particle in free space with no force field defining the z direction, 
the axis for the polar coordinates can be drawn in any desired direc- 
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tion. Thus the set of quantum states and of associated wave functions 
is in some degree a matter of arbitrary choice. This arbitrariness of 
the wave functions is another aspect of the general phenomenon of 
degeneracy, and its mathematical aspects deserve a short further 
discussion. 

Whenever several wave functions are associated with the same 
value of the energy Æ, any linear combination of them is another 
possible wave function for the same energy. Thus, let yr, yo, ws, 
- . ., Yx be solutions ef Eq. (14.3a) for E = E,. Then the linear 
combination y’ where 


WY = cr + coe + capa + - +: + ey (14.24) 


€1, Co, Cz, . . - , Ck being any constants, is also a solution. Thus y’ 
is as valid a wave function to represent a quantum state for the energy 
Es, as is any one of the functions Yi, v2, . ~~, Ya The possible 
wave functions associated with Æ, are thus infinite in number. It is 
possible, however, to find a number k of them in terms of which all 
others can be written as linear combinations. These k functions can 
be chosen in infinitely many ways, but their number is aiways k. 
Degeneracy of the quantum states has an exact analog in classical 
theory. For example, a drumhead can vibrate at a certain one of its 
higher frequencies with any diameter as a nodal line, such as Ay, By, 
Bz, etc., in Fig. 14.7. The mode of vibration at this frequency is 
thus not unique. It is possible, however, to choose any two modes 
as fundamental modes, for example, A; and A, or B; and B», and to 
Write the displacement during any vibration at the frequency in 
question as resulting from the superposition of vibrations of suitable 


Fig. 14.7 Possible nodal lines 
of a vibrating drumhead. 
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amplitudes in the two chosen fundamental modes. The type of 
vibration that actually occurs in a given case depends upon the manner 
in which the drumhead is set into vibration. Similarly, when an atom 
is in a degenerate state characterized by a definite energy, the exact 
quantum state or mixture of states that it occupies depends upon its 
past history. 

In the case of the one-electron atom, if the axis of polar coordi- 
nates is taken in a different direction, the factor Rnı(r) in the expréssion 
for Wnim, is the same as before, but the geometrical significance of the 
directional factor e'™*P?(cos 0) is different. It can be shown, how- 
ever, that the new directional factor can be written as a linear combina- 
tion of the directional factors relative to the original axis and, in fact, 
in terms of those factors, 2l + 1 in number, which are labeled with 
the same value of l. 

The infinite variety of the quantum states for given n and l 
corresponds to the infinity of different possible orientations of a Bohr 
orbit, and it is closely related to the isotropy of space. 


Appendix 14A - Solutions of the Radial Equation 


To solve Eq. (14.8) for R(r), it is convenient to set 


_ V¥—8m,E A Ze? Mr 


Sey a Sirah Nie Ans E 
Then Eq. (14.8) becomes 
R 2dR [a _1_ +) 
oe Bay [> 14 ] R=0 (14A.1) 


First we seek an asymptotic solution for p— œ. In this case 


a 


: R 
p A 


=0 and R(p) = Bier!? + Be-el? 


Since the positive exponent leads to R —> © as p— ~, we set Bi = 0 
and assume that R(p) can be written in the form 


R(p) = eF (0) (14A.2) 
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where F(p) must satisfy the equation 


OF E ENET [5 pals: 2 |F =0 (144.3) 


dp? * \p dp p p 
One can solve this equation by the power-series method, assuming that 

F(p) = p(t + ip + a? + - - -) = p bs a;p? (144.4) 

j=0 

where p° appears ahead of the summation to take care of the singularity 
atp = 0. Substitution of Eq. (14A.4) into Eq. (14A.3) yields 

$ Us + Als +j — aj? + 2(s + jja; 

7=0 

= (s + fast! — apti ap") — I(l + lay] = 0 


or 


Is(s + 1) — K+ Ila? + Y [s+ D+F +1 — (+ Da; 
j=1 
= (s +j —dNa-ajp? = 0 
Since the coefficient of each power of p must vanish separately, 
it follows that 

s(s +1) — (l+1)=0 indicial equalion (14A.5a) 

(+7 -= Na = [(s +j) +j +1) — I(l --1)ja;  (14A.50) 

The indicial equation requires that s = l; the solution s = —(l +1) 


ïs excluded since s must be positive if R is not to have a singularity at 
r= 0. Therefore the recurrence relation (144.56) becomes 


S L+ j=. 
C+ IDCTHIFYD =F) U 
j=1,2,3,... (144.6) 


aj 


If R is to go to zero at r = ~, the series for F(p) must terminate, 
which occurs if à = l +j. Thus, if \ = n is an integer greater than l, 
the series ceases after n — l terms and R goes to zero at infinity. But 
à is defined to be 
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Therefore, the Schrédinger equation is satisfied only for those values 
of E given by 


m,etZ? m,etZ? 


En = — Sark ~~ Begin 


(144.7) 


Insertion of this value of Z, into the definition of «æ yields 


8m,2e'Z? 1 2m,e2Z 2Z 
= z) ar = 14A. 
On = 15057 thin? h Arehn nan eee) 
where ag is the Bohr radius (= 4rreoh?/m,e2 = 0.529 X 10-10 m). 
The solutions R,u(r) of Eq. (14.8) consist of the product of (1) a 
normalization constant Cy: (2) e~s", and (3) a polynomial in ær of 
n — l terms. This polynomial in turn is the product of (anr)? and the 


associaled Laguerre polynomial Lè}? (ær). 


Problems 


1. Show that the most probable value of r for the ground state of 
hydrogen is the Bohr radius ag and that the expectation value of r is 
3ag/2. 

2. Calculate the expectation values of the potential energy P and the 
kinetic energy K for a hydrogen atom in the ground state. Compare 
these with the values of P and K in the Bohr model. 

3. Show that the sum of the values of y*y for the six (mı = 1, 0, —1; 
m, = +3, —3) 2p states of hydrogen is spherically symmetric, i.e., is a 
function of r alone. 

4. Classically how far could a hydrogen electron with the energy char- 
acteristic of the ground state get from the proton? Show that the 
quantum-mechanical prediction of finding the electron of a hydrogen 
atom in the ground state at this classical limit or farther from the pro- 
ton is almost 25 percent (actually 13e-*), 

5. Show that the operator for the square of the orbital angular momen- 
tum, which is given in Eq. (13.17) in cartesian coordinates, is given by 
Eq. (14.12) in spherical polar coordinates. 

6. Show that the commutator operator A.A, — A, A, is equal to in A,. 
7. Show that the hamiltonian operator for the one-electron atom’ 


commutes with the operator for the z component of the angular momen- 
tum; i.e., show that A(A,), — (A), = 0. 
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8. Show that for a Bohr orbit the ratio of the spin-orbit interaction 
energy to the electrostatic potential energy P is of the order of 
|P/2mc?| = |En/me?|. 

9. A solid cylinder of mass m and radius R has a charge q spread uni- 
formly over the curved sides of the cylinder. If this checker model of 
the electron rotates about its axis with angular velocity w, calculate the 
gyromagnetic ratio. 

10. By operating directly on the function f(@) = 5 cos? 6 — 3 cos 0 
with operator Â; and (AD: in spherical coordinates, show that f(@) is 
an eigenfunction of both operators. Find the corresponding eigenvalues. 
11. Show that 15 sin? 0 cos 0 e°? is an eigenfunction of the angular 

A PN 4 

momentum operators A; and (A;): by direct substitution in Egs. (13.16) 
and (14.12). What are the corresponding eigenvalues? 

12. After a series of measurements on the n = 1 state of hydrogen the 
angular part of the wave function has the form +/3/4r sin 8 sin ¢. 

(a) Compute ((A)),). 


(b) Find the probability that a measurement of (A,); yields the 
value h. i 


13. Show that if pe < moc?, the relativistic correction (Eq. (14.16)] 
is given by 


4 
AE, = =fut gra Vn do 


Hint: Write the hamiltonian as H = ~/p%c? + mẹèct — moc? + P, 
expand as series in p, and apply first-order perturbation theory. 
14. Use first-order perturbation theory to calculate the shift in the 
ground state of the hydrogen atom due to the finite radius of the proton 
if it is assumed that the proton charge is distributed uniformly through- 
out a sphere of radius a = 1.2 X 10-15 m. 

Ans: AE ~ 7 X 10-9 eV 


15. An electron is confined inside a sphere of radius a by a potential 
P(r) = 0 for r < a and P(r) = œ for r >a. Show that the eigen- 
functions for states with zero angular momentum are given by 
y(r) = const (sin kr)/kr. Determine the relation between k and the 
energy, and show that the energy eigenvalues are n?h?/8ma?. 

16. If P(r) = 0 forr <a and P(r) = Pù for r > a, find the smallest 
value of Po for which there is a bound state with no angular momentum 
and energy Po. 


Ans: h?/32ma? 
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17. The three-dimensional-oscillator problem (Chap. 13, Prob. 10) can 
also be solved in spherical polar coordinates. Take P = 2mw*r? and 
solve the Schrödinger equation in these coordinates. What are the 
eigenvalues for angular momentum? 

18. The wave function yo, represents a circulating current. Making 
use of Eq. (12.41), calculate the magnetic moment associated with this 
current and show it is 1 Bohr magneton. 

19. Set up and solve the Schrödinger equation in terms of r and ¢ 
for the two-dimensional analog of the hydrogen atom, taking the poten- 
tial energy to be P = —e?/4reor. Show that the radial distribution 
function has its maximum at one-fourth of the Bohr radius and that the 
ground-state energy of the planar atom is 4 times that of the hydrogen 
atom. [See Zaslow and Zandler, Am. J. Phys., 35: 1118 (1967).] 

20. When an electron moves in an “orbit” around a nucleus, the axes 
of a frame in which the electron is instantaneously at rest appear to 
precess relative to a frame fixed to the nucleus with an angular fre- 
quency w, = —v X a/2c?, where v is the velocity and a the acceleration 
of the electron relative to the nucleus. [This is the Thomas precession 
(Sec. 14.7).) Derive this equation. Hint: Consider three inertial 
frames: S, fixed on the nucleus, S’, fixed on the electron and moving 
with speed v along the z axis of S, and S”, moving with speed a dl along 
the y axis of S’, 


chapter fifteen 


Atomie Structure Ti S 


In Chap. 14 we considered atoms containing a single electron. We 
now discuss properties of atoms (and molecules) that contain two or more 
electrons. The Rutherford-Bohr model of the alom was built out of raw 
material from the field of physics. Originally, however, the alom was the 
child of chemistry. The ultimate theory of atomic structure musl explain 
the facts of chemistry, which are Just as cogent as the facts of physics. In 
the present chapter, we deal with the periodic table of elements; in later 
chaplers we return to the study of spectra. The two topics are closely 
related, for wave mechanics furnishes the key to the theoretical understanding 


of both. 


15.1 The Pauli Exclusion Principie 


If there were no repulsion between the electrons, the theory of complex 
atoms would be very simple. Each electron, being subject only to the 
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field of the nucleus, might be in the 
were alone in the atom. Actually 


repulsion must be large, particularly for an electron relatively far from 
the nucleus. However, asa first approximation we may assume that the 
effect on any one electron of all the other electrons is to reduce the 
effective charge below the nuclear charge Ze. The total negative charge 
on all electrons in a neutral atom is numerically equal to the positive 
charge on the nucleus, so that at points far outside the atom the electrons 
screen off the nuclear field and the net field is zero. Inside the atom, 
as the nucleus is approached, the screening becomes less and less, until 
near the nucleus the field of force becomes essentially that due to the 
nucleus alone. Even the innermost electrons, however,. are affected 
by the presence of the others, for the work required to remove any elec- 
tron to infinity is considerably diminished by the repulsion of the others. 
Suppose, for example, that the total negative charge on the elec- 
trons were spread out continuously with spherical symmetry, its (nega- 
tive) density p being a function only of the distance r from the nucleus. 
Then, by a familiar result in electrostatics, the charge in a shell of radius 
rı and thickness dr, (Fig. 15.1), of magnitude 4rpr,? dr;, would produce 
a potential’ of magnitude (eri? dry) /eor at any external point r and a 
Corresponding electric field. Inside the shell it would produce no elec- 
tric vector but only a uniform potential (pr:2.dr,)/rieo. The total elec- 
trostatic potential at a distance r from the nucleus would thus be 


same sort of electronic state as if it 
» however, the effect of electronic 


_ Ze l fr pri? dry © pry dry 
aU ar orcad) Mice ah ne (15.1) 


Here e is the numerical electronic charge; Ze is the positive charge on 
the nucleus and is related to p by the equation fe 4rpry? dr, = — Ze. 


dry 


Fig. 15.1 A shell of radius r, and thickness 
dr, centered on the nucleus. 


Alomic Structure 383 


20 


15 Unit for r: 


2 
ap ETER = 0,529 x 10-%q 
me 


i : n 


r, Bohr radii 


Fig. 15.2 The Hartree field and charge-cloud density for argon. The poten- 
tial energy of an additional electron inside the argon atom would be —eZ,/4mneor, 
where Z, is shown in the lower curve. K, L, M indicate the approximate 
locations of the n = 1, 2, and 3 shells, respectively. 


This classical picture suggested the following simplified wave- 
mechanical approach to the problem, which has been very useful. The 
total electronic charge is imagined to be spread out in a spherically sym- 
metrical charge cloud analogous to that described for one electron in 
Sec. 14.4, and the combined field of potential P(r) due to the nucleus 
and to this electronic charge is calculated by Eq. (15.1). As an example, 
the calculated radial density nr of the cloud charge for a neutral argon 
atom is plotted in Fig. 15.2; P is also plotted in a certain manner, as 
explained under the figure. 

The field as thus obtained may be used as it stands in calculating the 
deflection of an external electron such as a cathode ray passing through 
the atom. An internal electron, however, will in reality be repelled only 
by the other electrons, not by itself. To correct for this fact, we omit the 
part of the charge cloud that is due to the internal electron under 
consideration. 

For a first approximation to serve as the starting point for perturba- 
tion theory, the electrons are then assumed to move without repelling each 
other in this modified central field, the average effect of their repulsion 
having been included in the field. Because of the spherical symmetry 
of the field, the electronic wave functions are almost as simple as in the 
hydrogen atom. In classical mechanics, the vector angular momentum 
would remain constant. Analogously, in wave mechanics the elec- 
tronic states are characterized by the same angular-momentum properties 
as in a one-electron atom; the wave functions contain, in fact, the same 
mathematical functions of @ and ¢ that are represented in Egs. (14.11) 
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for a single electron, only the radial factor Rn: being different. The 
radial factor is still characterized, however, by a number n, and, as in 
a coulomb field, it has n — l numerical maxima, separated by zeros 
ifn —l>1. The electronic states may thus be characterized by four 
quantum numbers n, l, mn and m., where: 


1. nis the principal (or radial) quantum number which is restricted to positive 
integer values 1, 2, 3, 4, etc. 

2. Lis the orbital (or azimuthal) quantum number, which may take on the values 
0, 1,2,...,n—l. The orbital angular momentum of the electron has 
the magnitude Ar = V I(l + 1) h. 

3. mis the magnetic quantum number and takes on the positive and negative 
integer values /,/—1,...,0,...,—l, The z component of the orbital 
angular momentum is given by (Ai): = mih. 


4. m, is the spin quantum number, which is +3 or —}. The z component of 
the spin angular momentum of the electron is m,Ì. 


An assignment of all electrons to electronic quantum states will 
then define, in the zero order of perturbation theory, a quantum state 
for the atom as a whole. In assigning the electrons, however, account 
must be taken of a most important principle enunciated by W. Pauli in 
1925 before the development of wave mechanics. In its early formula- 
tion the Pauli exclusion principle is: 


No two electrons in any one alom can be in the same quantum stale. 


Since each electronic quantum state may be characterized by four 
quantum numbers n, l, m, and m, (or n, l, j, and mj), we may say: 


No two electrons in any one alom may have the same four quantum numbers. 


For discussing atomic structure it is convenient to use the restric- 
tive form of exclusion principle presented above. However, it is pos- 
sible to formulate the Pauli principle in a much more general form. In 
quantum mechanics identical quantum particles, such as electrons, can- 
not be distinguished from one another if they occupy a region such that 
their wave functions overlap. For example, within a helium atom we 
cannot say “electron 1 is here, and electron 2 is there.” The wave 
function for the system predicts the probability of finding an electron 
here and another one there, but it cannot predict which electron is which. 
This wave function depends on the general coordinates x; and z; of the 
electrons, where z: includes the z, y, 2, and spin coordinates. Since the 
probability distribution is unchanged by exchanging the two electrons, 


we have 


|W (wiz) |? = |Y (z;2:)|? 
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from which we have 
V(eiz;) = £¥(e,2:) 


It is an experimental fact that for all spin-} particles (electrons, 
protons, neutrons, muons, etc.) the negative sign must be chosen, 
while for particles with zero or integer spin the positive sign must be 
chosen. In wave-mechanical language the Pauli exclusion principle 
may be stated in the form: 


The wave function for the slate of a system containing tivo or more electrons must 
be completely antisymmetric in the generalized coordinates of the electrons; i.e., the sign 
of the wave function must change if any two electrons are interchanged. 


The wave-mechanical treatment of complex atoms is beyond the scope 
of this book, but in Appendix 15A some important features of that topic 
are illustrated, and a way of introducing the spin coordinate is presented. 


15.2 Shells and Subshells 


In a neutral atom in its ground state the Z electrons occupy the Z 
lowest energy states available. The energy of an electron in the modified 
central field depends, as in a coulomb field, chiefly upon n, but in this 
more general case it varies also somewhat with l, increasing (becoming 
less negative) as l is increased because of the penetrating orbit effect 
(Sec. 15.6). For the moment we shall ignore the fine-structure splitting 
of the (n,l) energy levels; in this approximation the energy does not 
depend on m: and m,. 

The electrons in an atom normally fall into distinct groups differing 
in the value of the quantum number n. Electrons having the same n 
are said to belong to the same shell. Each shell is divided into sub- 
shells according to the values of l. Any subshell may contain up to 
2(21 + 1) electrons, but no more, since m, can range from the value l 
down through zero to —l, making 2l + 1 different values; m, may be 
either 4 or —}. The numbers of electrons in the various types of closed 
(or full) subshells, together with their symbols, are: 


Symbol 
Number 


Thus the full 4d subshell in tungsten (with n = 4, l = 2) contains 10 
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electrons. The maximum possible number of electrons in a whole shell 
of any atom is 2n?. To specify a zero-order quantum state for the atom, 
it suffices to specify a sufficient number of sets of the quantum numbers 
n, l, m, Mm, with no two sets alike in all four numbers, to accommodate 
all the electrons in the atom. For the normal, or lowest-energy state 
of the atom, the electrons must be assigned to states of the lowest energy 
possible. The first two electrons go into ls states with n = 1,l=0, 
mı = 0, and m, = +3. Since no other states are possible with n = 1, 
the next two electrons go into 2s states, with much higher energy, and 
the next six into 2p states, with slightly higher energy still. The next 
two go into 3s states with considerably higher energy, and so on. A 
specification of n and | for all the electrons bound to a nucleus is called 
the electronic configuralion of the atom (or ion). 

The electrons in any completely filled shell or subshell contribute 
zero resultant angular momentum, both orbital and spin, to the angular 
momentum of the atom as a whole. In such a closed subshell, for 
every electron with z component of spin angular momentum +yh 
(m. = +3) there is another electron with z component —34. Thus the 
resultant spin angular momentum for the closed subshell is zero. Simi- 


mih there is one with z component —mih. The resulting charge distri- 
bution for a closed subshell is spherically symmetric as well. It is these 
facts which make hydrogen-like wave functions a reasonable first approxi- 
mation for the many-electron atom. Any resultant orbital angular 
momentum and spin angular momentum and any asymmetry in charge 
distribution from the electron distribution in an atom arises from elec- 
trons which are in partially filled subshells. The vast majority of the 
electrons in many-electron atoms are in closed subshells, 

The shells are often referred to by the capital letters which became 
attached to them in the early days of x-ray study. These designations 
are shown below and also, in part, in Table 15.1, in which the numbers 
and notation for the first three shells are summarized: 


n 1 


2 3 4 5 6 a 
X-ray designation K Tà M N o 12) Q 


Table 15.1 Electron shells and subshells 


Shell 
r 


Subshell, J 
ætter designation 


Number of electrons in subshell or shell 
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15.3 The Periodic Table 


The concept of atomic number was firmly established by the work of 
Moseley and others on x-rays. An atomic number Z may be unambig- 
uously assigned to each element. Thus the periodic table, drawn up 
by the chemists primarily on the basis of chemical facts, is established 
with the assurance that there are no unknown gaps in it. The value of 
Ze represents the charge on the nucleus; it is also equal to the number 
of electrons surrounding the nucleus when the atom is neutral. The 
chemical properties of an element depend upon the number of the circum- 
nuclear electrons in a neutral atom. Some understanding of the periodic 
table may be obtained by determining the arrangement of the electrons 
in the atom and the physical and chemical properties that follow from 
this configuration. 

A bird’s-eye view of the chief features to be explained is best 
gained by studying the arrangement of atoms in the periodic table inside 
the back cover. A striking feature is the recurrence of a noble, or 
“inert,” gas, forming a series of turning points in the progression of the 
elements. It was pointed out by Rydberg that the values of Z for the 
noble gases could be expressed by a simple numerical series: 


Z = 2(1? + 2? + 22-4 32 4 324 424...) 


viz., helium, 2 X 12 = 2; neon, 10; argon, 18; krypton, 36; and xenon, 54. 

The noble gases show little tendency for their atoms to join with 
other atoms in chemical combination. The elements standing in the 
table on either side of a noble gas, on the other hand, are strongly active 
chemically and have contrasting properties. Those closely following a 
noble gas, like lithium, beryllium, potassium, calcium, and so on, are 
metallic and strongly electropositive; they readily form positive elec- 
trolytic ions. Furthermore, their maximum valence in chemical com- 
pounds is equal to the number of steps by which they lie beyond the 
noble gas, e.g., 1 for sodium, 2 for magnesium, 3 for aluminum; sodium 
forms univalent ions in solution; magnesium, bivalent; aluminum, triva- 
lent. The elements closely Preceding a noble gas, on the contrary, are 
electrically nonconducting, perhaps even gaseous, and strongly elec- 
tronegative; they tend to form negative electrolytic ions, alone or in 
combination with other atoms, and they exhibit chemical valence equal 
to the number of steps by which they precede the noble gas in the table. 
An element of either of these two kinds scarcely combines with another 
element of the same kind; but an element closely preceding a noble gas 
in the table combines readily with any element closely following a 
noble gas. 


These facts invite the conclusion that there is something very 
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peculiar about the arrangement of the electrons of a noble gas. If we 
make the reasonable assumption that the chemical activity of an atom 
is conditioned somehow upon the magnitude of its external electric 
field, we may conclude that the atoms of the noble gases must be sur- 
rounded by very weak fields. If so, there should be little tendency for 
the atoms of these gases to combine into molecules or to condense 
into liquid or solid form; it is a fact that the noble gases are composed 
of monatomic molecules and also have very low boiling and freezing 
points. 

The properties of the elements adjacent to the noble gases are then 
accounted for if we suppose that the arrangement of the outer electrons 
in an atom of a noble gas is an especially stable one, i.e., an arrangement 
of especially low energy. An atom of an element following a noble gas 
in the table will then contain one or two extra electrons outside a noble- 
gas core; e.g., sodium has one and magnesium two electrons outside of 
a neon core; and these extra electrons may well be comparatively easy 
to detach. The tendency of such atoms to form positive ions would thus 
be explained. Furthermore, in the solid state these extra valence electrons 
may easily come loose under the attraction of neighboring atoms, func- 
tioning, therefore, as free electrons, so that the elements in question 
ought to be good conductors of electricity, which they are. 

An element such as chlorine, on the other hand, could arrange its 
electrons as they are arranged in argon if it had one more electron. Ele- 
ments closely preceding a noble gas in the periodic table might exhibit 
a tendency to pick up an extra electron, thereby forming a negative 
ion. Some of these elements do, in fact, form negative ions that are 
more stable, i.e., have lower energy, than the neutral atom; this is true 
of the halogens and of oxygen and sulfur. In the solid state, such atoms 
would contain no electrons with a tendency to become free; thus, the 
absence of electrical conductivity in these elements would be accounted 
for. 

These ideas furnish an explanation for the formation of a compound 
such as sodium chloride. In combining, the sodium atom loses 1 elec- 
tron, its remaining 10 electrons then forming the stable configuration 
that is characteristic of neon (but, doubtless, somewhat more compressed 
because the nuclear charge of sodium is higher). The chlorine atom 
adds the electron lost by the sodium to its own 17, making 18 electrons 
arranged in the stable argon configuration (but slightly expanded). 
The electrostatic attraction of the two ions thus formed then binds them 
tightly together into a molecule. When the molecule of sodium chloride 
thus formed is put into water, the attraction of the ions is weakened and 
the molecules fall apart into the constituent ions, each with its outer 
group of electrons in the arrangement characteristic of an inert ga8-*? 
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a crystal of sodium chloride, also, the grouping into molecules disappears; 
a crystal of this type is composed of ions but not of molecules. Thus 
many chemical and physical facts can be correlated if we make the 
assumption that the arrangement of the electrons occurring in a noble 
gas is a peculiarly stable one of low energy- 

One of the principal problems of an explanatory theory of the atom 
is to account for the high stability of-certain electronic configurations. 
Then an explanation must be found for the systematic sequences that 
occur in the periodic table between the noble gases. As the atomic weight 
increases, these sequences become longer and the elements of a sequence 
become more similar in chemical properties; many of the rare earths 
are even difficult to separate chemically. A successful theory should 
lead automatically to all these relations between chemical properties and 
atomic number on the basis of as few assumptions as possible. 


15.4 The First Two Periods 


The key to the periodic table was finally furnished by wave mechanics, 
with the help of the Pauli exclusion principle and the principle of elec- 
tron spin. It will now be shown, taking the elements in the order of 
their atomic numbers, that the electronic groupings as inferred from 
wave mechanics enable us to understand the sequence of the elements. 
The neutral alom will be under discussion. 


a. Z =1: Hydrogen A single electron in an atom is normally in a 1s 
electronic quantum state. Degeneracy exists because of the two possible 
values of m,. 


b. Z = 2: Helium Two electrons combined with a nucleus for which 
Z = 2 can both go into Is states, with quantum numbers 1, 0,.0, 4 and 
1, 0, 0, —4. Because the electronic charge cloud in helium is spherically 
symmetrical, there is almost no electric field outside the atom. Atoms 
of helium should therefore exhibit comparatively little tendency to 
associate themselves into a liquid or solid phase, in agreement with 
observation. In helium the K shell is complete; the electronic con- 
figuration is 1s? All heavier atoms will be expected to contain, next to 
the nucleus, a complete K shell of this sort. 


c. Z =3: Lithium The third electron in lithium outside the helium- 
like core occupies a state with n = 2. The energy associated with a 2s 
wave function is somewhat less than that of a 2p function for an electron 
in the atomic central field (see Sec. 15.6); hence, the electronic constitu- 
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but m, = +4, so that again there is a twofold degeneracy in the ground 
state of lithium. 

The energy for a 2s wave function lies much higher than that for 
alsfunction. If the field werea coulomb field, as for a one-electron atom, 


tion of normal lithium is 1522s. Since s means l = 0, we have m, = 0, 


Hence, the 2s electron should be comparatively easy to remove. Lithium 
forms positive ions easily, and it conducts electricity when in the solid 
state, the 2s electrons functioning as “free” electrons, Lithium com- 
bines chemically with a valence of l, as shown by such compounds as 


more energy than to remove the 2s, or valence, electron. In harmony 
with this conclusion, the first ionization potential of lithium is observed 


Z=4: Pidio Two electrons outside the K shell can both be 
2s states but with Opposite spins. The resulting element should be 


valent, since both 2s electrons should come off relatively easily. Su y 
a 


the case. The oxide, hydroxide, and chloride have the formul 


0, Be(OH)2, BeCl,. The first two ionization potentials of berylliu 
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are 9.3 and 18.2 V. To remove a third electron out of the Is shell 
requires 154 eV, and to remove the last one, 218 eV. 

Again the spectral evidence confirms the theory. The arc spectrum 
of beryllium is a two-electron spectrum of singlets and triplets like that 
of neutral helium. In the spark spectrum, on the other hand, doublet 
lines like those from neutral lithium are found; these are emitted by 
singly ionized beryllium-atoms. A few singlet lines are also known in 
the spark spectrum; they are part of a two-electron spectrum emitted 
by doubly ionized beryllium atoms. Lines have also been found which 
are ascribed to triply ionized atoms. 


e. Z = 5: Boron With five electrons in the atom, one goes into a 2p 
state. Boron is a trivalent element, witness B203, B(OH)s, BCl. It 
is not metallic, however; crystals of boron are good insulators. Evi- 
dently, with three electrons present in the Z shell, conditions are not 
favorable to the formation of free electrons in the solid state. To 
remove the 2p electron from a boron atom requires only 8.3 eV; to 
remove the two 2s electrons in succession requires 25 and 38 eV whereas 
to remove one of the ls electrons as well requires an additional 259 eV. 


The next elements in order are most easily understood if we pass 
them by for the moment and consider neon next. 


f. Z = 10: Neon It is possible to put into an atom two 2s electrons 
and six 2p electrons, or eight in all, with n = 2. The L shell is then 
filled. With every possible value of m, and m, represented, the- elec- 
tronic charge cloud is symmetrical about the nucleus, as it is in helium; 
and now the symmetry is sufficiently complete to result in a gas having 
a low boiling point. Neutral neon in its normal state has the electronic 
constitution 1522s?2p°*. 


g. Z = 9: Fluorine (1s?2s?2p5) If Z = 9, the neutral atom contains 
seven electrons in the L shell, or one less than enough to fill it. If 
one more electron were added, we should have a negative ion the exterior 
of which would be a closed shell, as in neutral neon. It cannot be 
expected that the same loss of energy would occur when an electron is 
added to a neutral atom as when it is added to a positive ion; but, on the 
other hand, the electron is added in the shell for n = 2 in an atom having 
a fairly strong nuclear field (Z = 9). We can thus understand the fact 
that fluorine forms univalent negative ions which are stable, i.e., have 
lower energy than the neutral atom, and that it exhibits a negative 
valence of 1 in chemical combination. There is little tendency for a 
second extra electron to be bound by the fluorine atom, for it would 
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have to occupy a state with n = 3, the energy of which would lie con- 
siderably higher. 


h. Z = 8,7: Oxygen (Z = 8, 15?2s?2p4) commonly exhibits a negative 
valence of 2 in chemical combination, as in lithium oxide, Li,O; and 
nitrogen (Z = 7, 1s?2s?2p3) is commonly trivalent, as in lithium nitride, 
LizN. Often nitrogen is united with oxygen into a compound as in 
LiNO:. 


i. Z = 6: Carbon (1s?2s?2p?) With four electrons in the L shell, 
which can accommodate eight, carbon is an element for which it becomes 
questionable whether it is energetically favorable to lose or gain elec- 
trons in interacting with other elements. A frequent compromise is to 
share electrons in a covalent bond (Sec. 21.2). In the amorphous form 
(graphite) carbon exhibits fair metallic conductivity, but in diamond it 
is an excellent insulator. 


15.5 Remainder of the Periodic Table 


The second octet of elements, from Z = 11 to Z = 18, parallels closely 
the first octet. Sodium (Z=; 1s725?2p°3s) contains a single valence 
electron outside the neon core. Sodium is univalent, as in NaOH and the 
familiar NaCl. Its arc spectrum is a typical one-electron spectrum. In 
general, it resembles lithium closely. Magnesium (Z = 12) is a bivalent 
element similar to beryllium. It burns with a brilliant white flame to 
form the oxide MgO. It has an are spectrum of singlets and triplets 
resembling that of helium. Aluminum (Z = 13) is trivalent, like boron, 
but it is metallic and an excellent conductor of electricity. The ses- 
quioxide, Al,Os, occurs in crystalline form as sapphire and ruby. Silicon 
(Z = 14) is a good deal like carbon. The dioxide SiO», however, which 
occurs in quartz, is a substance of extremely high melting point, whereas 
the analogous compound, COs, is a gas! Only an elaborately refined 
application of wave mechanics can explain contrasts such as these. 
Phosphorus (Z = 15), although chemically much more active, forms 
compounds analogous to those of nitrogen. In the poisonous gas phos- 
phine, or PH,, it is trivalent, just as nitrogen is in gaseous ammonia, 
NH;. Sulfur (Z = 16) corresponds to oxygen. In HLS, it is bivalent, 
just as oxygen is in H,0. Chlorine (Z = 17) is univalent and easily 
forms negative ions in solution; in general, it resembles fluorine closely 
but is less active. Finally, in argon (Z = 18), we reach again an atom 
composed of complete subshells, with electronic formula 1522s?2p°3s?3p°. 
Thus, argon has completed K and L shells and, outside these, two more 
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completed subshells. The M shell is not yet full, since there are no 3d 
electrons. Nevertheless, the symmetry of the 3p* configuration is evi- 
dently enough to make the external field around argon very weak, so 
that it acts as a noble gas with a very low boiling point (—186°C). 

At Z = 19, we might expect the addition of an electron in a 3d 
state (n = 3, l = 2). But the next element, polassium, closely resembles 
sodium and lithium, not only in chemical properties but also in its 
spectra even to the finest details. Since atomic spectra in the visible 
region are emitted by electrons in the periphery of the atom, they 
furnish valuable information concerning the state of the outermost 
electrons. A careful wave-mechanical calculation confirms the con- 
clusions from chemistry and Spectroscopy that the valence electron in 
potassium is in a 4s rather than a 3d state. A qualitative discussion 
of the reason appears in Sec. 15.6. 

The next element, calcium (Z = 20), corresponds rather well to mag- 
nesium in the preceding octet, but beyond this point the sequence becomes 
quite different. Beginning with scandium (Z = 21, 1s?2s?2p°3s°3p%45°3d) 
the 3d subshell begins to fill, expanding the n = 3 shell from the pro- 
visionally stable s?p® octet into the complete group of 18 electrons. 
Atoms with an incompletely filled d shell belong to the group known 
as fransilion elements, of which several are of great economic and scientific 
importance. They are somewhat alike in chemical Properties, which are 
largely determined by the outer 4s electrons. For all 3d transition 
elements the 3d and 4s energy states are very close together. Indeed 
the outer electron configuration for chromium (Z = 24) is 3d°4s. For 
nickel (Z = 28) the configuration is 3d%4s?, but from copper (Z = 29) 
onward the 3d level remains below 4s. 

Between gallium (Z = 31) and the noble gas kryplon (Z = 36) 
the 4p subshell is filled. As is the case after every noble gas, the next 
element starts a new shell; rubidium (Z = 37) has an electron con- 
figuration consisting of a krypton core and-a 5s electron. After the 5s 
subshell is filled, the 4d, 5p, and 6s levels are filled. Beginning at 
cerium (Z = 58) the 14 possible 4f electrons are added in turn, inside 
already completed 5s and 5p subshells. The exteriors of all these atoms 
are closely similar, so that we havea group of chemically similar elements, 
the rare earths. 

The electronic configurations of the elements in their normal states 
are given in the Appendix. Also tabulated are the ionization energies 
for the neutral atoms of each element, the ionization energy being the 
energy required to remove the least tightly bound electrons from the 
atom. These ionization energies are plotted in Fig. 15.3. A systematic 
relation is evident between the quantum numbers of the outermost 
electrons and the ionization potentials, with the p* configuration of the 
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Fig. 15.3 The ionization energy in electron volts as a function of atomic number. 
(Reprinted from H. E. White, *" Introduction to Alomic Spectra.” Copyright 1934. 
McGraw-Hill Book Company. Used by permission.) 


noble gases giving a particularly high ionization potential and the single 
s electron in a new shell, characteristic of the alkalis, giving particularly 
low ionization potential. 


15.6 Penetrating Orbits 


We now turn to the question of why, for a given principal quantum 
number n, the states of higher angular momentum (higher l) lie above 
those associated with those of smaller angular momentum in a multi- 
electron atom. To be specific, let us discuss the atom sodium (Z = 11) 
for which 10 electrons fill all possible states with n = 1 and 2. These | 
electrons produce a spherically symmetrical charge cloud about: the 
nucleus, the net effect of which is to reduce the effective charge providing 
the field in which the eleventh electron moves. The probability of 
finding one of the inner 10 electrons between r and r + dr is shown by 
the shaded region in Fig. 15.4. On the same figure are shown the quan- 
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Fig. 15.4 Calculated charge density in the neutral sodium atom for 
three alternative states (3s, 3p, 3d) of the yalence electron; the density 
due to the core is shown shaded. (Reprinted from H. E. While, “ Intro- 


duction to Alomic Spectra.” Copyright 1934. McGraw. Hill Book Com- 
pany. Used by permission.) 
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tum-mechanical predictions for the radial charge density 4zr*y*y for the 
eleventh electron in a 3s, a 3p, and a 3d state. The probability of 
finding a 3d electron inside the core is extremely small; for a 3p electron 
the probability is larger; for a 3s electron it is much larger still. Clearly, 
for an electron inside the core the effective nuclear charge is substantially 
greater than for an electron outside (for which Zar = 1). If the elec- 
tron lies almost entirely within the stronger field, its associated energy 
is expected to be lower. Such an argument is not entirely conclusive, 
however; for in the mathematical theory the energies are determined by 
the boundary condition at infinity. We must remember that as the 
electron penetrates the core, its total energy is not decreased, but instead 
there is a conversion of potential energy into kinetic. However, in a 
coulomb, field of strength Ze?/4reor? the energies of the corresponding 
States are all lower by a factor of Z? than for the coulomb field from a 
unit charge. It is thus not unreasonable that the energy level should 
sink as the degree of penetration into the core increases. That this is 
true is shown by detailed quantum-mechanical calculations; further, the 
fact is thoroughly supported by all the evidence of atomic spectroscopy. 

It is customary to speak of the s and p States in terms of penetrating 
orbils, a term which remains from the days of Sommerfeld’s expansion 
of the Bohr theory to elliptic orbits. While the idea of sharp orbits is 
unacceptable in modern wave mechanics, the language of Sommerfeld’s 
model continues to be widely used. Not only is the penetrating-orbit 
effect enough to explain why for a given n states of smallest / lie lowest, 
but it is also adequate to explain why low-angular-momentum states of 
a higher shell may have a smaller energy than the high-angular-momen- 
tum states of a shell of lower n. For example, we have seen that for 
potassium the 4s state lies below the 3d. Similarly, for rubidium the 


5s state lies below the 4d; for cesium the 6s lies below the 5d and the 
4f as well. 


As we have seen, the 4s level lies below the 3d for potassium (Z = 19). 
For hydrogen, on the other hand, the n = 4 levels all lie higher than the 
n = 3 levels; the dependence of the energy on lis very small and arises 
from entirely different causes. For elements of Z = 2 to 5 the 3d level 
remains below the 4s, but at carbon (Z = 6) the electron core has grown 
sufficiently for the 4s level to fall below the 3d. From there to chromium 
the 4s level lies lower, but at chromium (Z = 24) the two are extremely 
close together, and one of the two 4s electrons goes to the 3d state, 
giving a 3d°4s configuration. For manganese (Z = 25) through nickel 
(Z = 28) the 4s level falls below the 3d, but at copper (Z = 29) the 3d 
state sinks below the 4s and remains there throughout the rest of the 
periodic table. Similar crossovers occur for the 5s and 4d levels and for 
the 6s, 5d, and 4f levels. 
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15.7 Low-lying Energy States 


Ia discussing the periodic table, we have assumed that in a neutral heavy 
atom there are 2 electrons characterized by n = 1, 8 with n = 2, 18 
with n = 3, and so forth. Further, we have assumed that each occupies 
an energy state and satisfies a wave function which may be crudely 
approximated by the corresponding hydrogenic energy and Yam The 
most direct experimental evidence for the validity of these assumptions 
lies in x-ray phenomena. We begin by considering the absorption of 
x-rays in matter. In contrast with the apparently chactic state of 
affairs in regard to the absorption of light in the visible or near-visible 
portions of the spectrum, we find comparative simplicity in the laws for 
x-ray attenuation. Measurements of the attenuation coefficient for a 
monochromatic beam of parallel rays are readily made by use of the 
ionization spectrometer (Fig. 7.6) with a tube voitage low enough to 
eliminate second-order reflections, For a given crystal angle @ and 
wavelength A, the ionization current is measured both with and without 
a sheet of absorbing material of known density p and thickness x placed 
in the path of the beam, say between the two slits S; and Ss. These ~ 
measurements give, respectively, I and I, in the equation 


I = [gem = Ig (15.2) 


from which either the linear allenuation coefficient or the mass allenualion 
coefficient p/p can be computed. Figure 15.5 shows the mass attenuation 
coefficient of lead in the wavelength range 0.1 < à < 1.2 A. Beginning 
at point o, „/p rises rapidly with increasing wavelength, until point a, 
corresponding to \ = 0.1405 A and p/p = 8 (about), is reached, at 
which the value of /p suddenly drops to point a’. This is the K absorp- 
tion limit, which we associate with the photoelectric ejection of elec- 
trons from the n = 1, or K, shell.: Photons with wavelength shorter 
than the K absorption limit can and do eject electrons from the K shell, 
while photons with longer wavelengths do not have enough energy to 
eject a K electron. Thus the K absorption limit gives a measure of the 
energy with which a K electron is bound to the atom. For lead ^x, 
- the wavelength of the K absorption edge is 0.1405 A, and the correspond- 
ing energy is Ex = he/d = 88,000 eV. 
With further increase in wavelength, the absorption again increases 
rapidly, being mostly due to the ejection of electrons from the L (n = 2) 
` ghell; until at à ="0.780 A there is another sharp drop. After a brief 
rise there is another absorption edge at 0.813 A, and a third at 0.950 A- 
These three discontinuities are known respectively as the Li, Ly, and 
Lym absorption edges. They reveal that there are three distinct energy 
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Fig. 15.5 K and L absorption edges of lead. 
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All elements with Z > 10 have one K absorption jump and three 
L discontinuities; those with the n = 3 shell full have five M edges, and 
so forth. Thus the x-ray absorption pattern is similar for all elements, 
with the wavelength at which any given edge appears becoming shorter 
as Z increases. The fact that there are three groups of electrons asso- 
ciated with n = 2 is clearly not compatible with the idea that all the 2s 
electrons have one energy and all the 2p electrons another energy. 
Similarly for n = 3, we have l = 9, 1, and 2, but there are five character- 
istic M energies, and for n = 4 we find seven energies, which is more than 
l = 0, 1, 2, 3 might suggest. To explain the existence of the additional 
energy states we take into account the level splitting between states 
with the same n and l but different j (Secs. 14.6 and 14.7). Equation 
(14.23) gives a good approximation for x-ray levels provided an appro- 
priate effeclive Z is used to take account of the shielding of other electrons. 


15.8 X-ray Energy Levels 


In terms of the spin-orbit interaction it is now possible to explain the 
number and types of x-ray absorption edges. For the K electrons l = 0, 
so that there is no orbital angular momentum and hence no spin-orbit 
interaction. The only allowed value of j is 4, and we find a single K. 
level. For the L shell (n = 2) we have for l = 0 only the possibility 
J = ż, while for l= 1 there are possibilities j = } and j = 3. We find 
that the entire pattern fits in place if we assume that the Lr absorption 
discontinuity is associated with the state having l = 0, j = 3; the Lir 
edge with l= 1, j = 3; and the Li limit with /=1, j = 2 For 
the Lı level m; can be +4, allowing two electrons in this state. For 
the Ly: level m; = +4, and again two electrons are allowed. The Lut 
state has j = 3, and so m; can be $, 3, —4, or —%, a total of four pos- 
sibilities. Thus in the nljm; description, there are in a filled L subshell 
twice as many electrons in the Li level as in the Ly or Ly states. - One 
woul: expect that there would be a bigger drop in the absorption coeffi- 
cien when four electrons are involved than two; this is indeed the case 
in Fag. 15.5. 

When the spin-orbit interaction is included, it becomes convenient 
to think of each subshell as defined by a given nl being split into two 
nlj subshells, each composed of all electrons having the same values of 
n, l, andj. In the absence ofglectric or magnetic fields all electrons in 
a given nlj-subshell have the same energy. Each closed subshell con- 
tains 2j + 1 electrons, one for each value of m;. Table 15.2 shows the 
symbols for the various inner subshells and the maximum number of 
electrons allowed in each. 
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Table 15.2 Symbols for electron shells, subshells, and 
x-ray levels 
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Fig. 15.6 Robinson's magnetic spectrograph for studying the photo- 
electric action of x-rays. 

\ 
Robinson and his collaborators, whose apparatus is shown diagram- 
matically in Fig. 15.6. A beam of x-rays of frequency » enters through 
a thin window W a highly evacuated brass box BB and falls upon a 
target T of the material under investigation. Photoelectrons are expelled 
from the surface of T in all directions and with various velocities. The 
whole apparatus is placed in a known magnetic field B, at right angles 
to the plane of the paper, which can be varied at will; the photoelectrons 
describe circles in this field. Some of them pass through the narrow slit 
S and eventually strike the photographic plate PP. If the electrons 
leaving T have velocities vı, v2, . . . , they will move in circles of radii 
Tio ra . . . and will strike the plate at points Li, Lo, .... As shown 
in the figure, the arrangement is such as to “focus” electrons leaving 
the different parts of the target with the same velocity onto the plate 
at such positions as L; and Ls, the diameter of the circle being the dis- 
tance between S and Lı or L». 

Values of the kinetic energy K of the photoelectrons can be cal- 
culated from measured values of r, for comparison with h(v — va)s 
where v4 is any absorption limit. Robinson found on his plates a number 
of “lines” representing groups of photoelectrons. The corresponding 
absorption limits were in complete agreement with values determined 
spectroscopically. He found one K level, three L levels, five M levels, 
and, in the case of U*, five of the seven N levels, the pairs of levels 
Nıv,v and Nyz,vir being too close together to be resolved in his apparatus. 

The technique of measuring the velocities of photoelectrons ejected 
by x-rays has been developed to the point where this method is now one 
of the most reliable for the precise determination of physical constants. 
By using double-focusing electron spectrometers and other improvements 
it is now possible to resolve lines differing in energy by only one one- 
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hundredth of the energy difference formerly required. For example, 
Hagström, Nordling, and Siegbahn have found that the K electron of 
sulfur is bound 5.5 eV more tightly in the 6+ state of ionization (Na2SO,) 
and 4.3 eV more tightly in the 4+ valence state (Na:SO;3) than in pure 
sublimed sulfur. 


Appendix 15A - Wave Mechanics for Two 
Identical Particles with Spin One-half 


Suppose that there are two identical particles in the square well of 
Sec. 13.3. Assume for the moment that they do not interact with each 
other and that no such phenomenon as spin need be considered. Let 
the coordinates of the particles be zı and z+. Then, in classical mechanics, 
the combined (kinetic) energy K of the particles equals (pı? + p:*)/2m, 
where pı and pz are their respective momenta. Replacing the momenta 
by operators as in Eq. (13.12a), we have as the time-free wave equation 
for the two particles 


h: (= + =) = Ey (15A.1) 


~ grm (ðr? * azz? 


Here y is a function of both z, and zz; it has the significance that 
lvl? dz: dzz 


is the probability of finding particle 1 in dz, and, simultaneously, particle 
2 in dz}. Our present problem is simple, however, in that solutions of 
(15A.1) can be found which are products of functions of z, and z: sepa- 
rately. Such a solution, satisfying the boundary conditions and nor- 


malized so that h dz; If dz: |u|? = 1, is 


Var = Wn(Zi)Ve(Z2) = 7 sin nr T sin kr T 0<z<L 


where Yn and yx are one-particle functions and n and k are any two 
positive integers. The associated energy of the system is 
h? n? + k? i 


Ent = m L7 (154.2) 


This solution represents particle 1 as being in state n and particle 
2asinstatek. A new and very important form of degeneracy now appears, 
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however, which may be called exchange degeneracy. For, if zı and x2 are 
interchanged in Yaz, provided n ¥ k, another wave function correspond- 
ing to the same value of the energy is obtained, viz., 


Tat LS tre T2 
Wen = 7 sin kay sinns 


154.1 Interacting Particles 


These functions as written are not suitable for use in a perturbation 
calculation. For, if an attempt is made to correct for a small per- 
turbing term f(#1,22) in the energy operator, a difficulty is encountered 
arising from the fact that Yaz and y¥;, are associated with the same energy. 
Equations analogous to (13.42) are obtained, and, among them, one in 
which n is replaced by nk and k by kn; after removing the prime from 
E}, in this equation, as was done in proceeding to (13.42a), the equation 
takes the form 


(Ein — Ens)bne + fy des fy f(er22)Vnvnu ds = 0 


Here, however, Ein — Enx = 0; whereas the integral may not vanish. 

The simplest and usual procedure in such cases is to substitute 
suitable new combinations of the degenerate wave functions. Instead 
of Wax and yn, let us employ as zero-order functions two combinations 
of these functions so chosen that the matrix component of the given 
perturbation energy f(z1,22) between the new functions vanishes. The 
perturbation matrix is then said to be diagonalized with respect to these 
particular zero-order functions, a completely diagonal matrix being one 
that has zeros for all its elements except those on the leading diagonal 
(top left to bottom right). When such functions are used, the difficulty 
disappears. 

The necessity of choosing suitable zero-order functions in cases of 
degeneracy has an exact classical analog. In the case of the vibrating 
drumhead described in Sec. 14.8, if the symmetry is destroyed by the 
addition of a small eccentrically placed weight w, the effect on the fre- 
quency can be calculated by means of perturbation theory; but it is now 
necessary to start from a particular choice of the zero-order modes of 
vibration (see Fig. 15A.1). One of the modified modes of vibration will 
closely resemble a zero-order mode in which the nodal line passes through 
the position of the added weight; the other new mode will have its nodal 
line in a perpendicular direction. One effect of the weight is thus to 
fix the nodal lines. Another effect is to replace the single zero-order 
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Fig. 15A.1 Nodal lines on a 
loaded vibrating drumhead. 


frequency by two slightly unequal frequencies associated with two 
alternative patterns of vibration. Thus the degeneracy is removed by 
the added weight. 

In the present case, the necessary combinations of Yar and Yn to 
serve as zero-order starting points for perturbation theory are found to 
be a symmetric one and an antisymmetric one defined as follows: 


Wars = = (Wnt + Yen) (15A.3a) 
Yara = = (Yank — Wen) (15A.3b) 


If x, and x2 are interchanged in these functions, it is obvious that ns: 
is unaltered, so that this function is symmetric for exchange of the two 
particles; whereas Yna just changes sign and is, therefore, antisymmetric 
for such exchange. The factor 1/72 is inserted to preserve the nor- 
malization. [Symmetry for particle exchange should not be confused 
with symmetry in the coordinates of one particle; an example of the 
first would be y = (zı — z2)?, of the second, y = 2722.) 

The new functions are orthogonal to all independent wave functions, 
and also to each other; for a short calculation shows that 


J dey if VarsWnka dz, = 0 


Furthermore, these functions have the property that the matrix compo- 
nent with respect to them of any operator that is symmetric in x, and z: 
vanishes. For example, suppose the complete wave equation con: 
tained a small potential term of the form P = f[(zı — z;)3]. Ther 
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if F dzı Í z SnesWara dz: = 0. This is most quickly seen by noting that 
if zı and zz are interchanged as variables of integration, which is always 
possible in a definite integral, the integral changes sign; but the only 
number that is equal to its own negative is 0. 

It will be noted that in both ys and Yara the assignment of a 
particular particle to state n or to state k has disappeared. In two 
important respects, however, these functions have contrasting properties. 

Consider first the probability densities 


Vars? = a(n? + Wen?) + WnrWin 
nea? = (Yne? + Wen?) — WarWen 


In the right-hand members of these two expressions, the first term repre- 
sents the probability density on the assumption that the particles move 
independently. For, if particle 1 is in state n, the chance of finding z, 
in dz, is Y»? dzı, and similarly it is y4? dz, for z, in dzz, so that the com- 
bined probability for both as independent events is the product of these 
two expressions, or ¥,4? dz, dz2; and similarly for the alternative arrange- 
ment represented by yn. The second term on the right is a wave- 
mechanical novelty. Wherever z, and T: have nearly equal values, 
VutWin = nt? = Want, approximately, so that the effect of the last term 
is almost to double ¥,.5? but to reduce Vaza? almost to zero. Thus there 
exists an exchange effect that causes the particles to favor close proximity 
when the wave function describing their state is symmetric under 
exchange of their coordinates but to avoid each other’s company when 
the wave function is antisymmetric. This exchange effect is quite 
independent of any force-action between the particles. It has no analog 
in classical theory. 

In the second place, consider again the first-order energy corrections 
due to a perturbation that is itself symmetric in the particles. For con- 
creteness, suppose that the particles carry equal electric charges q and 
that their mutual electrostatic energy is P, where P = q°/4rer. Then, 
in analogy with Eq. (13.41) 


1 2 
aos 2 If ee (Wnt? + Yen’) dz dz: 


2 
+ [J GE vee des de, (15440) 


2 
AE, = ai ae (Wns? + Vin?) dz, dz, 


2 
a f J Toa Waren dzi dz, (15A.4b) 
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Here it is possible to give to the first term in AEs and AE, a semi- 
classical interpretation. We can write 


[| pve desde = f andeniae: f Fay Wale dz, 


and the last integral, in dz,, can be regarded as an expression for the poten-. 
tial at the point z, due to the cloud charge representing the first particle; 
whereas q[¥x(z2)]? dz: is the density of the second charge cloud at this 
point. A similar interpretation can be put upon the contribution from 
Yin?. Thus the first term in AEs and AE, can be interpreted as the 
mutual potential energy of the two charge clouds due to classical coulomb 
repulsion of the particles. The second term on the right in (15A.46), on 
the other hand, positive in AEs and negative in AE4, contains an inte- 
gral, known as the exchange integral, which has no direct analog in clas- 
sical theory. Its opposite sign in the two cases arises from the wave- 
mechanical tendency already noted for the particles to cluster together 
in a symmetric state but to avoid each other in an antisymmetric state, 
thereby resulting in different mean values of the perturbing potential in 
the two cases. This effect occurs in addition to other possible effects 
caused by mutual attraction or repulsion of the particles, which may 
result in further corrections to the energies. 

It may be remarked that the exchange effect gives rise here to a 
physical distinction that does not exist in classical theory. Even in the 
presence of the perturbing potential, there would be two possible classical 
motions differing only in that the two similar particles are interchanged, 
and, once started, these motions could not be distinguished from each 


other by observation. In wave mechanics, on the other hand, the two 
Corresponding quantum states differ in energy. 


Similar features occur in three-dimensional cases also. The 
exchange effect is limited, however, to exactly similar particles. 


154.2 Electron Spin. The Exclusion Principle 


be complete, the treatment must 
now be extended to include electron spin. 
the handling of two Separate wave 
very intricate. It is more con- 
mbolism by which these separate 
ally into a single, composite wave 
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8. A beam of Sr? Ka, photons (à = 0.8735 A) falls on thin gold foil. 
Using the data of Prob. 4, find the kinetic energies of the photoelectrons 
ejected from the Ly and Lr levels. 


9. The mass-attenuation coefficients of aluminum, copper, and lead 
for x-rays with \ = 0.880 A are respectively 0.975, 9.12, and 13.5 m?/kg. 


Find the mass per unit area of each of these metals required to reduce the 


intensity of a beam of 0.880-A photons by a factor of }. What are the 
thicknesses of the palf-value layers if th 
2700, 8940, and 11,350 kg/m?? 


10. ‘Consider the matrix operator $ = 14. + lisy + 14, with 


_afo 1 _afo i _afi 0 
2-41 Al n=3; l s=3lo A) 


Show by direct calculation that $» sn ê satisfy the c 


e densities are respectively 


ommutation rule 
for angular momentum, that is, [$., $,] = ihs,. Show that a = P 
0 OF A 

and f = k are simultaneously eigenfunctions of $? and $, and find 
the corresponding eigenvalues. 


=" 


chapter sixteen 


X-ray Spectra m a a 


In the Preceding chapler we discussed the alom as a nucleus surrounded 
by a cloud of electrons. To a reasonable approximation we regard each 


n = 1), eight L electrons (n = 2), and so forth. Data from Photoelectric 
absorplion are compalible with this idealization. We now consider various 


16.1 Low-lying Energy Levels 


In treating atomic states thus far, the energies of levels have been Teferred 
to a zero corresponding to a free electron at rest at infinity; thus all bound 


411 
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and 
Aj =0, +1 (16.16) 


T > sitions for which An = 0 are rarely observed in x-ray spectra, 
though they are common in optical spectroscopy. The conditions that l 
must change by +1 and that j must change by 0 or +1 come naturally 
out of the wave-mechanical calculation of the probability of an electric- 
dipole transition between two states (Sec. 13.10). 

In addition to the prominent x-ray lines, all of which arise from elec- 
tric-dipole transitions, there are a few weak lines which arise from mag- 
netic-dipole and electric-quadrupole transitions for which wave mechanics 
gives different selection rules. Other faint lines originate in transitions 
between states of double ionization (Sec. 16.6). 

There is no universally accepted system of names for x-ray lines, 
though the designations in Fig. 16.1 are widely used. Except for the 
most prominent lines, it is customary (and always specific) to list the 
transition by giving the initial and the final atomic level. The general 
basis for the nomenclature is essentially as follows. First, the series 

(K, L, M, etc.) is given, corresponding to n for the initial state. Then 
the strongest line in the series when observed under low resolution is 
named a, the next strongest 8, and so forth. Under higher resolution 
the a line is in general a doublet; the more intense component is called 
a, and the weaker az, In the L series the £, line is next strongest. 
There are many weaker lines in the same wavelength region; these are 
called £2, 63, . . . , 61s. A similar system is used for the y lines. The 
relative positions of the less prominent lines vary from element to ele- 
ment, and a numbering system which appears entirely logical for one 
element may seem chaotic for an element of substantially different 
atomic number. 

The Ka, line arises from a K — yy; transition for the atom, 
corresponding to an Lir electron jumping to a vacancy in the K shell. 
The Koz line is associated with a Ly electron filling the K vacancy. 
Since there are four (2j + 1) Ly electrons and two Li; electrons, one 
might guess that the Ka, line is twice as intense as the Kaz line, an 

-inference well substantiated by experiment. Similarly the line Kf: 
(K > Mu) is twice as strong as KB: (K — Mn), again because the 
statistical weights (2j + 1) are in the ratio of 4:2. However, this 
simple proportionality to the statistical weight is good only for the inten- 
sities of two lines of very nearly the same frequency. Thus Ka is 
much stronger than K£, although the statistical weights of Linx and Mir 
are both 4; the probability that an L electron will fill a K vacancy is very 
much greater than that for an M electron. The relative intensities 
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of certain L and M x-ray lines can be calculated by applying the Burger- 
Dorgello-Ornstein sum rule (Sec. 17.3). 


16.3 Electron Excitation of X-ray Levels 


One of the first experimental tests of the theory of x-ray spectra was made 
by Webster in 1916. Using a rhodium target, he increased the energy 
of the bombarding electrons and observed the intensity of the rhodium 
Kay line at 0.613 A. A plot of the intensity of x-radiation at this wave- 
length as a function of tube voltage is shown in Fig. 16.2. Such a plot 
is called an isochromal. The curve shows that the intensity increases 
gradually as the voltage across the x-ray tube increases, until at 23.2 kV 
it rises abruptly. This can be explained as follows. Below 23.2 kV 
the intensity at 0.613 A is due to the continuous x-radiation, which 
increases as the voltage is raised, as discussed in Sec. 7.8. None of this 
radiation is characteristic of the rhodium; a similar intensity would be 
expected for any target of comparable atomic number. At 23.2 kV 
the incident electrons have sufficient energy to eject K electrons from 
rhodium atoms. As soon as vacancies in the K shell are produced, 
transitions from other states occur. In particular, electrons from the 
Lin level undergo transitions to the K shell and give rise to the rhodium 
Ka; line. As the energy of the bombarding electrons rises, the number 


Intensity at 0.613 A 


22 T ENEE aie 28, aly RTS 


Tube voltage, kV 


Fig. 16.2 An isochromat showing the intensity of the 


rhodium Ke line as a function of the potential di 
applied to the x-ray tube. $ reie 
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of K electrons ejected increases rapidly and the intensity of the Koi 
line rises steeply. 

Photons of the rhodium Ka; line have an energy of 20.1 keV, 
rather than the 23.2 keV. The difference of 3.1 keV corresponds to the 
energy of the Lr level. In order to excite the K lines it is necessary 
to eject an electron from the K shell. These electrons cannot go to 
an occupied level; they must either be ejected from the atom or else 
go to an allowed optical level. The excitation potential for all Rh K 
lines is exactly the same, since in every case it is necessary to eject an 
electron from the K shell before any K line can be radiated. In the case 
of x-rays elements do not strongly absorb the frequencies which they 
emit. This is in sharp contrast to the optical case. The reason is simple: 
in optical spectra electrons are raised to vacancies in higher levels and 
return to the ground states; in x-ray spectra electrons must be raised to 
an unoccupied level. When there is a vacancy in an x-ray level, it is 
most likely to be filled by one of the electrons from an intermediate 
level. 

Many isochromats have been run for other x-ray lines. The 
results are in excellent agreement with data from other experiments 
designed to measure the energies of low-lying atomic energy levels. 


16.4 X-ray Doublets and Screening Constants 


The manner in which the energy of a given inner electron level varies 
with atomic number is shown in the Moseley plot of Fig. 16.3. A 
striking feature of that figure is the fact that the separation of the Moseley 
lines for the levels Lir-Li1, Mn-Min, Miy-My, etc., increase steadily 
as Z rises, while the separation of the lines for Li-Li, Mi-My, Min-Myy, 
etc., remain roughly constant. The intervals Li-Li, My-Mi, . are 
known as screening doublels and correspond to differences between levels 
having the same n, s, and j, but different l values. The intervals Lu-Lit, 
Mu-Min, . . . are called spin-relativity doublets, or regular doublets, and 
correspond to differences between levels with the same n, l, and s, but 
different j. 

The general features of the doublet separations can be explained 
semiquantitatively on the basis of fairly simple arguments. To the 
extent that the energy of a given level can be attributed to the coulomb 
interaction of an electron with the nucleus and the average charge 
density of a symmetrical electron cloud, the energy of a level is given 
in first approximation by En = —Rhe(Z — o,)?/n?. Here o, is a screen- 
ing constant which takes into account the effect of the electron cloud, 
so that Z — a1 is an effective nuclear charge. 


i 
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Fig. 16.3 Moseley plot for the x-ray levels where T is the term value in 
wavenumber units; VT/R = V E/13.6, where E is the energy of the level 
in electron volts. (Afler H. E. White, ‘Introduction to Atomic Spectra.” 
Copyright 1934. McGraw-Hill Book Company. Used by permission.) 


The splitting of an n,l level into two fine-structure levels for J=l+h 
and j = | — 4 is due to the spin-orbit interaction, for which Eq. (14.23) 
is appropriate provided we replace Z by Z — øz, where oz is a screening 
constant which is less than o;. The reason o> is smaller is that for the 
spin-orbit interaction it is the net electric intensity experienced by the 
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electron which is important, not the energy. A symmetrical shell of 
electrons external to the electron in question takes no part in reducing the 
effective nuclear- charge insofar as the electric intensity is concerned, 
but it does insofar as energy is concerned. For the relativistic correc- 
tion, Eq. (14.16), it turns out that Z — c» should replace Z, and then 
Eq. (14.16) is a valid approximation, even for l = 0. When the rela- 
tivistic and spin-orbit corrections are added to E, above, we have for 
the energy Eny of an x-ray level characterized by n,l,j the result 


(Gao)? aZ- T3 1 
En; = —Rhe| aa > =O z REF: ; 
Hi +1) — Ul + 1) — s(s + 1) 
it 2+ UD ]} (16:2) 


where a = yġy. (Ina more detailed calculation other terms in higher 


powers of a? occur.) We are now in a position to explain the various 
doublet separations. 


a. Regular-doublet Law The energy difference AE between two spin- 
relalivily doublets is Proportional to the fourth power of Z — oz, where oz 
is the screening constant appropriate for the spin-orbit interaction. 
Two energy states differing only in j correspond to the same “‘orbits”’ 
and have the same first terms in Eq. (16.2). Increasing j by 1 clearly 
changes the energy by an amount proportional to (Z — o)4, Since 
only electrons inside the orbit in question influence o+, one expects that 
aa should be independent of Z. But a increases as one goes to sub- 
shells further from the nucleus, Sommerfeld found that øs is 3.5 for 


Lit, 8.5 for Mr, 13 for Miy,v, 17 for Nira, 24 for Niy,y, and 
34 for Nyvz,yitt- x 


all corrections involving a2 
of the doublet 
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n 


X-ray Spectra 419 


Nvi vi 


Miy,v 


Mum 
Mı 


Lim 


AE coxa | = L 
45 55 65 75 85 92 
Z 


Fig. 16.4 The screening constant e; as a function of atomic number. 
(Afler Sommerfeld.) 


Since a and b involve different orbits, cı is different in the two cases and 
we have A VĒ = /—E, — VE, = ~/Rhe Aoi/n, where Ac; is the 
difference in screening constants. The screening constant cı increases 
with Z (Fig. 16.4), as we expect. The changes in slope of the N curves 
are associated with the filling of the 4f shell for 57 < Z < 72. 
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16.5 The Fluorescence Yield and the Auger Effect 


Thus far the one process we have discussed by which an pera atom. 
can go to a lower energy state is by the emission of a Pron, ica 
is by no means the only possibility. As early as 1909 a ee ve 
that the number of K photons emitted from metal appearcd-to he less 
than one-third the number of K vacancies produced in the metal. If 
n. electrons are ejected from the K shell of some element and as a result. 
there are np K photons emitted, the K fluorescence yield wx is defined 
to be 


Aree fe) (16.3) 


The fluorescence yields of other levels are similarly defined; for any 
state w is the fraction of the ercitalions of the slale which lead directly to 
the emission of a pholon. Several methods for measuring fluorescence 
yields have been devised. It is found that wx increases with the atomic 
number of the emitter, going from less than 0.10 for light elements to 
more than 0.95 for uranium. 

If only 40 K photons are emitted for 100 K electrons ejected in 
copper, what process leads to the filling of the other 60 vacancies? The 
answer emerged from the work of Auger in 1925. Using a Wilson cloud 
chamber, he studied the x-ray ejection of photoelectrons from argon 
atoms. He found that over 90 percent of the long photoelectron tracks 
were accompanied by a short track. To make a quantitative study 
of the latter, Auger diluted the argon with hydrogen to increase the 
length of what we now call the Auger tracks. He found that: 


1. The Auger track always originates at the same point as a photoelectron track. 


2. The direction of ejection of the Auger electron is random and independent 
of the direction taken by the photoelectron. 


3. The length of the Auger track is independent of its direction, of the direction 
of the photoelectrons, and of the frequency of the incident x-ray photons. 


The first point suggests that the photoelectron and the Auger 
electron come from the same atom. The other two are consistent with 
the hypothesis that the ejections of the photoelectron and the Auger 
electron are independent events, occurring sequentially rather than 
simultaneously. In Auger’s experiments the ejection of the photo- 
electron left the argon atom with a vacancy in the K shell. When one 
L electron filled this vacancy, a second L electron was ejected as the 
Auger electron, thus leaving the atom doubly ionized. Some persons 
have called this process the internal Photoelectric effect, imagining that 
a Ka photon was absorbed on its way out of the atom by an L electron, 


| 
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which was thus internally ejected as a photoelectron. There exists an 
abundance of evidence that this is not the case. If this explanation were 
correct, one would find Auger electrons associated not only with atoms 
from which the photoelectron was ejected but also with other atoms. 
However, we do not find Auger tracks originating at other atoms. 
Also, from the absorption coeflicient of argon for its own Ka radiation, 
one expects that perhaps one photon in a million might be absorbed in 
the same atom from which it is emitted. But Auger found that 93 


“percent of the argon atoms had Auger electrons associated with the 


photoelectrons. It is generally agreed that no photon is emitted in the 
Auger process; rather the excited atom undergoes a transition in which 
one electron moves to a lower level and another electron is simultaneously 
ejected. An adequate wave-mechanical description of Auger transi- 
tións has been developed; it predicts transition probabilities of the 
observed magnitude. 

In a K > Llin transition the energy of the Auger electron is 
the energy of the K level less the energy of the doubly ionized atom 
described by Linkin. The latter is slightly more than twice the Li 
ionization energy because it requires more energy to remove an electron 
from an ionized atom of atomic number Z than from the corresponding 
state in a neutral atom. (In many cases it is a good approximation 
to assume this is the energy needed to remove an electron from the same 
state in an atom of atomic number Z + 1.) The energy of the Auger 
electron depends on the properties of the ejecting atom only. Auger 
transitions need not leave the atom with two vacancies in the same shell. 
For example, one Auger transition which is of importance for the La 
satellites is Lı > LixMy; in general, Auger transitions from the Ly state 
are so probable that the L-series lines originating from Lı vacancies are 
very much less intense than would otherwise be the case. 

It is the Auger effect that competes with radiation as processes by 
which an excited atom can lose energy. An Auger transilion is one in 
which an electron is emilted rather than a pholon when an exciled alom goes 
lo a lower energy state. When both electrons of a helium atom are simul- 
taneously excited from their ls orbitals, Auger transitions occur in 
which one electron is ejected and the other returns to the ls state. 
Such autoionization is not uncommon in atomic spectroscopy. 


16.6 Multiple Ionization of Inner Shells 


It was perhaps fortunate that the spectral apparatus available to 
Moseley and the early workers did not have the sensitivity and resolv- 
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ing power of present-day spectrometers. The aoe va Sa 
observed were the more intense and more easily resol vable lines 0 
x-ray spectra, due to transitions between states of single ionization 
and called first-order lines. y ; 

With improvements in technique, other lines were discovered 
which did not fit into the conventional diagram. The majority of 
these lines were rather faint, were usually found close to, and on the 
short-wavelengthi side of, the more intense diagram lines, and hence 
were called satellite lines. A typical spectral curve of the satellite 
structure is reproduced in Fig. 16.5. The satellite structure is observed 
to be complex, containing numerous component lines of various inten- 
sities. Most (if not all) first-order lines are accompanied by such 
satellites; the total number of satellite lines now known far exceeds 
the number of diagram lines, 

Because of the low intensity of the satellites, reliable experi- 
mental information on their characteristics is extremely difficult to 
obtain. The excitation voltage of certain satellites, however, has 
been definitely shown to be somewhat greater than the excitation 
voltage of the accompanying first-order, or parent, line. In the case 
of the type of K satellites that is illustrated in Fig. 16.5 the energy 
of excitation is found to be equal to the energy required to eject a K 
electron and in addition an L electron from the atom. Hence, we may 
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ie The Ke doublet of Cus with accompanying satellites. The 
‘Mtellite intensity, lower left, is obtained by subtracting the estimated 
intensity due to other lines. (Parratt, Phys. Rev., vol. 50, p. 1, 1936.) 
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assume that the initial state for the emission of these satellite lines is a 


state of double ionization, in which the atom has an electronic vacancy 
in both the K shell and the Ł shell. Such a state of the atom may be- 
called a KL atomic state. In a similar way, other states of double 
ionization, such as KM, LM, etc., should be possible. 

An atom in a KL state may undergo a radiative transition into 


` any one of a number of other states of double ionization, for example, 


KL— KM (an electron dropping from the M shell into the £ shell) _ 
or KL > LL (an electron dropping from the L shell into the K shell). 
Estimates of the atomic energy indicate that the loss of energy should 
be slightly greater in the transition KL — LL than in the diagram 
transition K — L, which gives rise to the Ka lines; hence, the former 


transition should give rise to satellites close on the short-wavelength 


side of the Ka lines. Similarly, the transition KL—LM should 
give rise to satellites on the short-wavelength side of the K£ lines. 
Presumably, a cathode-ray electron can eject two electrons at once 
from an atom. If this is the origin of the doubly ionized atoms, 
theoretical estimates indicate that the intensity of satellites relative ` 
to the parent lines should decrease in a continuous manner with 
increasing atomic number. Such a variation with atomic number 
is found by experiment to hold for satellites accompanying lines of the 
K series but not for L or M satellites. 7 
The intensity of the satellites accompanying the La line is 
„observed to decrease rather abruptly as the atomic number increases 
from 47 to 50 and to increase again rather abruptly at about 75; 
between atomic numbers 50 and 75, La satellites are practically 
unobservable. This anomalous behavior was resolved in 1935, when 
Coster and Kronig pointed out the importance of the Auger effect in 


-this connection. They suggested that the La satellites: came primarily 


from LyiMyy and Ly1My initial states and that these in turn were — 
produced by Auger transitions from L; states. The transitions Lr —> 
IniMiy and Ly > Ly1,My can occur only if the energy of the Lr state 
is greater tnan that of the doubly ionized states LinrMiy and LiMy. 
This condition is satisfied only for elements with Z < 50 and Z > 75. 
Hence, if we suppose that few atoms in the states LinMry,y are pro- 
duced directly by the cathode-ray bombardment, so that such states 
are produced chiefly by Auger transitions, the absence of La satellites 
from Z = 50 to Z = 75 is explained. 

The occurrence of Auger transitions should also have an effect 
upon the intensities of certain diagram lines. Thus, transitions for 
which Ey is the initial state should be weakened by removal of atoms . 
from the Lı state through Auger transitions and hence should undergo 


få rather abrupt change in intensity at Z = 50 and Z = 75 (about), in 
agreement with experiment. 


16.7 X-ray Spectra and the Outer Part of the Atom 


X-ray spectra are related primarily to the inner atomic structure. 
Concurrent changes that may occur elsewhere in the atom can largely 
be ignored because these changes have relatively little effect upon the 
energy levels arising from vacancies in the inner shells. Certain 
finer features of x-ray spectra, however, involve in their explanation 
a consideration of the outer part of the atom or even, in a solid or 
liquid, of the surrounding material. 

We have assumed that when an x-ray photon is absorbed, an 
electron is removed entirely from the absorbing atom. If the atom 
is isolated, however, as in a monatomic gas, it is possible for the elec- 
tron to stop in some outer vacancy in the atom; the absorbed energy 
hy would then be less than if the electron were removed to infinity. 
The K absorption edge of argon, studied with high resolving power, 
is shown in Fig. 16.6. In addition to the photoelectric absorption 

| extending toward higher frequencies from the K absorption edge, 
the curve shows a series of absorption lines on the long-wavelength 
_ side of this edge, resulting from atomic transitions into one of the K 
“resonance” levels. These lines should be closely spaced, however; 
for the energy differences between the levels should be of the same 
order of magnitude as the differences between the ordinary optical 
levels, i.e., a few electron volts or less. The electronic configuration 
for argon (Z = 18), 1s2s?2p°3s23p!, is altered in nonionizing K absorp- 
tion to 1s2s?2p°3s?3p'np, a p state being the only kind to which an 
electron can be excited from a 1s state by photon absorption (since 
the selection rule allows l to change only by 1). The spacing of the 
atomic levels under discussion will thus be almost the same as that of 
the optical P levels of potassium (Z = 19). Hence, if we ascribe the 
me- intense line in Fig. 16.5 to the electronic transition 1s > 4p, 
wean locate the positions of the other resonance lines and of their 
series limit by using the known optical P terms for potassium. The 
Photoelectric absorption edge should have a finite “width” when 
observed with such high resolving power; theory indicates that the 
shape should be given by an arctangent curve, whose width is due to 
the same factors that cause the width of each of the absorption lines 
and also of the emission lines. The theoretical absorption edge: is 
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Fig. 16.6 The K absorption edge of argon and its interpretation in terms 
» of a series of resonance lines at wavelengths greater than the continuous K 
absorption band, which is centered at 0, A (Courtesy of L. G. Parratt.) 


calculated position of the P-series limit. 

When an atom is brought into close Proximity to other atoms, 
x-ray phenomena involving only ionization in an inner shell are little 
affected, but those involving also the atomic exterior are considerably 
modified. In a solid, the outermost atomic electrons come to be 
associated with the entire solid rather than with individual atoms, 
and essentially continuous bands. of electronic states replace the 
discrete ‘atomig-states' so far ag these electrons are concerned (Sec. 


drawn in the figure as the left-hand dotted curve, centered at the 


21.5). The simplest cage is that of the alkali metals. Here the set of 
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i the valence electrons in their individual atoms 
an ar On abe. by a band of electronic states several electron 
volts wide in energy. Transition of electrons from this band to 
vacancies in an x-ray level leads to a broad radiation band rather than 
a sharp line. For example, the L emission from solid sodium, corre- 
sponding to transitions from the 3s band to the unresolved 2p level, 
is observed as a wedge-shaped band beginning at about 450 A and 
rising to a maximum near 390 A. 


16.8 Refraction and Reflection of X-rays 


a. Refraction The first positive evidence that x-rays are measurably 
refracted came from the work of Stenström, who showed from accurate 
measurements of wavelength that Bragg’s law for the reflection of 
x-rays from crystals does not yield identical values when the wave- 
length of a given line is computed from different orders of reflection. 
Hjalmar foun?, for example, that the apparent wavelength of the 
Fe Key line as measured in the first order by reflection from a gypsum 
erystal (2d = 15.155 A) was 1.9341 A; while measurements in the 
sixth order gave 1.9306 A, nearly 0.2 percent less. 

This apparent failure of the Bragg formula was shown to be due 
to refraction of the rays on entering the crystal, the index of refraction 
being less than unity. In Fig. 16.7 is shown the path of a ray entering 
at glancing angle 6 and incident on the Bragg plane PP at an angle 
6’. Bragg’s law in the form 


nd’ = 2d sin 4’ (16.4) 


where \' is the wavelength in the crystal, gives the true law of reflection 
at the crystal plane. To obtain the relation between @ and X, the wave- 
length in air, we use the law of optics, u = X/d’ = cos 6/cos 6’, u being 
the refractive index from air to crystal. Substituting values of \’ and 
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Fig. 16.7 Refraction of x-rays entering the surface of a crystal, - sis 
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i ing i f the 
of sin 6’ = (L — cos? 6’)} in Eq. (16.4), expanding in powers oj L 
i very small quantity 1 — p, and keeping only the first power of this 
quantity, it is found that 


oun a(t a 5) (16.42) 


A few values of 1 — u or 6 obtained with use of this formula by 
Larsson, from observations in which different orders of diffraction 
were compared, are as follows: 


ô X 10° 
A A i 
Mica Calcite 

1.537 8.94 8.8 
2.499 24.6 22.4 
3.447 49.1 41.9 
5.166 103 

7.111 182 

8.320 262 

a a 


Good values of 1 — 


u have also been obtained by refraction 
through a prism and by me: 


asuring the critical angle for total reflection. 


b. Reflection from Ruled Gralings 
index of refraction of materials is le. 
x-rays could be totally reflected in 
such as glass or a metal. 
a ruled grating might be u 
the same way that a grating is used in the op 
the glancing angle between the x-ray bea 
less than the critical angle for total reflection, Compton and Doan 
in 1925 were the first to make measurements of this kind. Using a 
grating of speculum metal with 50 lines per millimeter, they found the 
wavelength of the Ka, line of molybdenum to be ) = 0.707 + 0.003 

. Ten years later Bearden, using gratings having 100 or 300 lines 
per millimeter ruled on glass sputtered with gold, measured the wave- 
length a the Cu Kay line (1.5406 A) to an accuracy better than 0.01 
percent. 


tical region, provided that 
m and the ruled surface is 


Measurements made with ruled gratings afford the most reliable 
`means of determining the absolute magnitudes of xray wavelengths, 
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since they involve no assumptions as to the homogeneity of a crystal; 

in he only elements that enter into the determination by this 
method are the wave theory of light as propagated in a vacuum and 
such well-tested operations as the measurement of angles and the 
counting of lines under a micrometer microscope. 


Problems 


1. Indicate all allowed (electric-dipole) transitions beeen the N and 
M x-ray levels. 
2. If the K, L, and M energy levels of platinum lie at roughly 78, 12, 
and 3 keV respectively, compute the approximate wavelengths of the 
Ka and Kg lines. What minimum potential difference across an x-ray 
tube is required to excite these lines? At approximately what wave- 
length is the K absorption edge? At approximately what wavelength 
are the three L edges? 

Ans: 0.187 A, 0.165 A, 78 kV, 0.159 A, 1 Å 
3. The KA; line (K > Myx) of rhodium (Z = 45) has a wavelength of 
0.5445 A. Combining this with Webster’s data from the K isochromat 
(Sec. 16.3), calculate the energy of the My level of Rh. Check the 
reasonableness of your answer by interpolation, using Table 16.1. 
4. (a) If silver Ke photons strike a molybdenum absorber, find the 
kinetic energy with which K electrons are ejected. 

(b) If the molybdenum atom undergoes an Auger transition, K > 

LyLy1, calculate the initial kinetic energy of the Auger electron, assuming 
that the excitation energy of the state of double ionization is just the 
sum of the individual energies. Does this assumption lead to too high 
or too low an energy for the Auger electron? 
5. (a) In 1925 to 1926 Auger found that of 223 instances in which an 
incident photon ejected a photoelectron from the K shell of Kr‘, 109 
of the photoelectron tracks were accompanied by an Auger electron 
track. Find the K fluorescence yield in krypton. 

(b) Calculate the K fluorescence yield for argon (Z = 18) if out 
of 165 K photoelectron tracks, 153 are accompanied by Auger tracks. 

Ans: (a) 0.51; (b) 0.07 
é. Auger’s experiments on the photoelectric absorption of photons in 
xenon (Z = 54) lead to the results that wx = 0.71 while wz ~ 0-25- 
(His data did not permit him to distinguish from which L subshell a 
photoelectron arose.) If he studied 270 ejections from the K shell and 
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175 from the L shell, how many photoelectrons from each shell were 
accompanied by Auger electrons? 

7. Use the regular doublet law, Sommerfeld’s value for oz (Sec. 16.4), 
and the Ly — Lyx separation in Ag’ from Table 16.1 to calculate the 
Lyx — Ly separation in In‘. (The measured value is 227.5 eV.) 

8. From the screening-doublet law and the data for Pb% in Table 
16.1, estimate the energy of the Lyr level in Hg® if the Ly level of mercury 
has an energy of 14.873 keV. (The observed energy for Li; is 14.238 
keV.) 

9. Negative muons captured by protons make transitions from n = 2 
levels to the ground state. The photons emitted fall on a copper foil. 
From what Cu level is a photoelectron emitted with the smallest energy? 
What is the approximate kinetic energy of such a photoelectron? 

10. The critical angle for x-ray reflection from glass is 15 minutes of 
arc for photons with à = 1.54 A. What is the index of refraction of 
glass at this wavelength? 


Ans: 0.999906 


1l. The phase velocity of x-rays in a crystal is slightly greater than 
c, and so the index of refraction of the crystal may be written as 1 — 5, 
where ô < 10-5, 

(a) Show that when refraction at the crystal surface is taken into 
account, Bragg’s law is replaced by 


— Ài 
m = za(1 - Z r) sin 0 


sin? 0 


(6) Show that if \: and ^ are the apparent wavelengths obtained 


[or a given line by Bragg’s law in orders nı and ns, ô can be calculated 
rom 


201 — d2) 2 
psm nes 
MPR Go mat sin? 6; 


Which is known as Slensiröm’s formula. 

12. Calculate the percentage error introduced in nà by the use of the 
simple form of Bragg’s law in place of the more exact form (given in the 
Preceding problem) for the case in which 6 = 10-$ and 0 = 15°, 

Ans: 0.015 percent 

13. (a) Show that when x-rays are totally reflected at almost grazing 
incidence from a ruled grating, the condition for an interference maximum 
ìs nÀ = alcos @ — cos(@ + a)], where a is the grating space, 0 the glancing 
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angle, and @ + a the angle the reflected beam makes with the reflecting 
face of the grating. What is the physical significance of negative values 
of n? — 

(b) If 2.5-Å x-rays are incident at a glancing angle of 40 minutes 
of arc on a grating ruled with 50 lines per millimeter, find the glancirg 
angle @ + « for each of the two first-order maxima. 


Ans: 43.5 and 36.1 minutes of arc 


chapter seventeen 


Atomie a i T ee 


As the starting point for discu 


ssing the optical Spectra of multielectron 
atoms, it is convenient to utilize a veclor mod 


el in which the electron 
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nucleus of charge Ze arises from Z individual electrons, 

ee eee elesleod state of the kind discussed in Sec. 15.1 ae 
characterized by four quantum numbers n, l, m, and m,. Althoug the 
electrons lose their identity in the electron cloud, we can imagine each 
contributing spin angular momentum, orbital angular momentum, 
and the associated magnetic moments to the atom as a whole. The 
Seen angular momentum of the electron system is designated by As. 
In classical mechanics the vector angular momentum of any iso- 

lated system remains constant in time. Analogously, the quantum slales 
of any atom not subject lo exlernal forces can be so defined that when the 


alom is in one of them, the total angular momentum A, of its electron sys- 
tem has the fixed magnitude 


Ay = VIJFDA a71) 


and the component of A, about any chosen Z azis drawn through ils center of 
mass has the fized value 


(Ay)z = Maħ (17.2) 


where J and My are two quantum numbers characteristic of the quantum 
state. The number J is a positive integer or half integer or zero, and 
M; has one of the 2J + 1 integrally spaced values between My = —J 
and M; = J, inclusively. Thus, M; is integral or half integral accord- 
ing as J is. For example, if J = 0, My, = 0; if J = 3, My = } or —3; 
if J = 1, M; = l or 0 or —1; etc. 

The rule for determining whether J and My, are integral or half 
integral can be discovered by considering the electronic quantum states 
described in Sec. 15.1. In those states there is, for each electron, an 
integral value of the quantum number m; and also a value of m, which 
is either } or —4; and the total angular momentum of the electrons 


about the chosen axis is [Z(m, + m.)]h, where the sum Z is to be taken 
for all electrons in the atom. Thus, 


M; = X(m. + m.) (17.3) 
From this equality and the character of the values of mı and m, it is 
at once obvious that M; and hence also J are integral when the atom 
contains an even number of electrons and half integral when it contains 
an odd number. It can also be seen that the maximum possible value 
of Mz, and hence also of J, is 21 + N/2, where N is the total number 


of electrons in the atom; for the maximum value of any individual 
mz is l, and for each electron m, cannot exceed }. 
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The relationship of the quantum numbers J and My, can be visual- 
ized with the help of a vector diagram, just as the relation of l and m 
was visualized for a single electron (Sec. 14.5). As in the case of the 
one-electron atom, the vector angular momentum has no definite azimuth 
around the axis, and its components perpendicular to the axis have no 
definite values. 


a. Selection Rules Wave mechanics leads to the following selection 
rules for J and My, which are of great importance in spectroscopy. In 
any atomic jump associated with the dipole emission or absorption of a 
photon, J may remain unchanged or it may increase or decrease by unity 
but not by any larger amount. That is, if Jı and Jz are the values of J 
for the two states between which the jump occurs, then either J; = J2 
or Jı — Ja = 1 or Jı — Jz = —1. The same rule holds also for My. 
No jump can occur, moreover, from a state with J = 0 to another slale with 
J=0. In symbols, 


AJ = 0 or +1 (and not 0 > 0) AM, = 0 or +1 (17.4) 


These selection rules are not always strictly obeyed, however; sometimes 
jumps occur by quadrupole emission for which AJ = +2 or AM,= +2. 
Spectral lines due to such jumps are mostly very weak. 


These selection rules have analogies in classical theory (and were 
first proposed on this basis). Orbital angular momentum, in classical 
mechanics, is associated with rotation. Suppose that a body capable 
of rotation contains an electrical vibrator of some sort: and let this 
vibrator emit electromagnetic radiation of frequency » when the body is 
at rest. Then, if the body is set rotating with angular velocity w, the 
radiation emitted will contain just the three frequencies v, » + w, 
v— w. The reason is similar to that for the analogous effect of a mag- 
netic field upon a classical vibrating electron, which was described in 
Sec. 6.3 in connection with the classical explanation of the Zeeman 
effect. The three classical frequencies » + w, v, » — w correspond (in 
emission) to the three possible changes in My. This is an example of 


_Bohr’s correspondence principle. 


b. Angular Momenta of Closed Subshelts From the description of 
atomic subshells in Sec. 15.2 it is clear that in a closed subshell every 
- electron is matched by another whose quantum state differs only in that 
the sign of m; is reversed; and the values of m, are Similarly paired off 
Thus My = 2m + Zm, =0. This fact suggests that J = O als med ani 
surmise can be verified from the theory. Furthermore, Saye 0 ae 
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zm, = 0, separately, and, correspondingly, it can be shown that the 
total orbital angular momentum of the electrons in a full subshell like- 
wise vanishes, and so does the resultant angular momentum of spin. 

Thus, in considering the total angular momentum of an atom, all 
closed subshells can be ignored. An atom of a noble gas in its normal 
state, in which the electrons are all grouped into closed subshells, is thus 
necessarily in a state with J = 0. An atom containing one or more 
peripheral electrons outside the inner closed subshells will have angular 
momenta, orbital, spin, or total, that are determined entirely by the 
peripheral electrons, with no contributions from the closed subshells. 
The consideration of atomic angular momenta is thereby greatly 
simplified. 


17.2 Alkali-type Spectra 


At this point we may to advantage describe the gross features of the 
simplest type of spectra, that emitted by atoms containing asingle valence 
electron outside one or more closed subshells. Among neutral atoms of 
this sort may be mentioned those of the alkali metals, lithium, sodium, 
potassium, rubidium, and cesium. Other examples are singly ionized 
atoms of beryllium, magnesium, calcium, strontium, or barium; doubly 
ionized atoms of boron or aluminum; and so on. 

In an atom of this type only the valence electron is active, and only 
its quantum numbers vary. For the moment, let us disregard electron 
spin. The energy of an atom containing a single outer electron depends 
on the quantum numbers n and l. The latter specifies the orbital angu- 
lar momentum not only of the valence electrons but, in this case, also 
of the entire electron system of the atom, since contributions from the 
inner closed subshells vanish. However, when there are two or more 
electrons outside of closed subshells, this total orbital angular momentum 
is the resultant of that associated with the individual electrons. The total 
orbital angular momentum of the electron system is »/L(L + 1) à (Sec. 
17.6), where L is zero or integer; for the alkalis L = l, of course. Energy 
levels of atoms are commonly labeled with capital letters according to 
the following scheme: 


L 0 1 2 3 4 5 6 7 


8 D | r'| e AIRA ER 


'P 


In general, it is customary to use for the electron system of an atom 
capital letters L, S, J to characterize the resultants of the angular 
momenta represented by l, s, and j for the individual electrons. 


_— 
——— ee 


Alomic Spectra 435 


em~! K a A A SP S 


f 5 Sisig 
j 5 = es: = 
| 4 fa sF Aa 4F 
| 44 / 
10,000 H 
| =t- {=40-----4- 94 = 
3P z 3D 
f ; 
7 f 
iss Se i 
zeo 
| 
Í $3P 
| hat lask a AS a A hi Ba 
I bor ! 
30,000 | ! i 
H h 
i i 
| i i 
| h i 
| i 
1 
i ! 
4 i 


Fig. 17.1 The energy levels between 50,000 and 2000 cm-" of the five alkali metals 
and of hydrogen. All fine structure is ignored. 


The gross features of the energy levels of the alkalis are shown in 
Fig. 17.1; the doublet fine structure, characteristic of all save the S 
levels and discussed in the following section, is not shown. In this 
figure the energies are given in terms of wavenumbers, i.e., the number 
of waves per unit length, 1/A, since it is wavelength which the optical 
Spectroscopist measures. For the greatest L the energy is close to that. 
of the corresponding hydrogen level. For smaller L the energy levels 
lie lower, by virtue of the penetrating-orbit effect (Sec. 15.6). 

Spectral lines are emitted when the valence electron makes a transi- 
tion from one level to another, the energy of the photon being equal to 
the energy difference between the two states. Only certain transitions 
compatible with the selection rules are allowed. The quantum number 
L is usually subject to the selection rule 


AL = 0, +1 for jumping electron Al = +1 (17.5) 
Hence, no dipole jumps occur, for example, between S and F levels, 
In the case of a single active electron, where L = |, AL = +1 only. 

For the alkalis the most prominent lines arise from transitions from 
P to S levels. They are called Principal series lines. A line arising 


436 Introduction to Modern Physics 


from an S— P transition belongs to one of the sharp series (Sec. 9.2), 
while transitions from D to P levels lead to diffuse series lines. Berg- 
mann found a series of sodium lines in the infrared which originated in 
F — D transitions and for which the energies were close to those of some 
hydrogen lines. This was called a fundamental series. The identifica- 
tion thus made of the terms in the alkali spectra constitutes the historical 
reason for the use by spectroscopists of the mysterious letters S, P, D, 
F (from the words sharp, principal, diffuse, fundamental) to represent 
various values of L (or the corresponding small letters for one electron). 
For higher values of L it was agreed later to continue down the alphabet, 
skipping J. 


17.3 Fine Structure in Alkali Spectra 


By virtue of the spin-orbit interaction the energy of the atom depends 
also on the relative orientation of the spin and orbital angular momenta, 
Eq. (14.19). Foran alkali atom the resultant spin angular momentum is 
V (4)(@) À since the contributions from the closed cores are zero; only 
the valence electron gives the electron system a net spin angular momen- 
tum. Once again it is customary to use capital letters for quantum 
numbers for the complete electron system, and S is used to designate the 
total spin quantum number for the atom. In this case S = 4. 

For a sodium atom in its ground state the valence electron has 
n = 3, l = 0, and s = 4; for the electron system L = 0, S = 3, J = 4. 
It is customary to write this state 3s 2S, or 32S, (read ‘‘three doublet 
ess one-half”), where the first number stands for n, the superscript is 
2S + 1, the letter gives L, and the subscript J. Other states are speci- 
fied in the same way; for example, 47D, conveys the information that 
the outer electron has n = 4, the spin quantum number S is 3, L = 2, 
and J = $. 

The effect of spin-orbit interaction is to split the energy levels or 
terms into two, one with J = L + 4 and the other with J = L — 4. 
Only the S terms (L = 0) are single, with J = 3. Almost always the 
level with the larger J is found experimentally to have the higher energy. 
As an example, a diagram of the energy levels for neutral sodium is 
shown in Fig. 17.2. Each level is represented in this diagram by a 
short horizontal line. The diagram shows the splitting of the P levels; 
the subscripts on S, P, etc., denote J values distinguishing fine-structure 
sublevels, which are not shown separately for D and F. Plotted are 
binding energies measured downward from zero for the state of ioniza- 
tion (with no kinetic energy in the removed electron); these lerms are 


pressed in wavenumber units. The numbers written opposite the 
ex : 
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Fig. 17.2 Energy-level diagram for neutral sodium, showing the origin of the stronger 
spectral lines. Hydrogen levels are shown for comparison. (Wavelengths in 
angstrom units; the numbers Opposite the levels are values of n.) 


levels are the values of n. The most important spectral lines are indi- 
cated by oblique lines with wavelen 


gths in A. One weak line is shown 
(3S —3D)' that violates the AL = 0, +1 selection rule. 


By far the most prominent features of the sodium spectrum are the 
D lines in the yellow region of the spectrum. They come from 37S) — 3P} 
and 3S, — 3?P, transitions. There are four quantum states (M, = 3, 
2 —2, —#) associated with the P; level and two with Pi, Itis essentially 
1 It is customary for s; 


initial one. We shall follow 
as a reminder. 


pectroscopists to write the final state 


s first and then th 
this custom, using an arrow between S 


level designations 
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twice as likely that an excited Na atom be in a P, level as in the corre- 
sponding P, level and, as a consequence, the shorter à member of the 
Na D doublet is twice as intense as the other member. Transitions 
between two given nL terms give rise to several spectral lines, forming 
a spectral multiplet (Fig. 17.3). In accord with the selection rule 
AJ = 0, +1, jumps are possible between the single level of any S term, 
2S}, and either of the two levels in a P term, 2P, and 2P}. Between a 
P and a D, with levels D, *D,, three jumps are possible, viz., 


PUED, Paay IP 3D, 


(The jump ?P, 7D, is forbidden, since AJ = 2.) The resulting trio 
of lines is known as a compound doublet. : Thus the chief spectrum of the 
alkali metals consists of spectral doublets and compound doublets, hence the 
superscript 2 in the level designation. 

The relative intensities of the lines of a compound doublet may be 
calculated from the Burger-Dorgello-Ornslein sum rule. 


The sum of the intensities of all the lines of a mulliplet which arise from the same 

inilial slale or terminate al the same final slate is proportional to the statistical weight 
2J + 1 of the initial or final slate respectively. 
For the case of the ?P 2D multiplet, let the relative intensities of the 
lines be a, 8, y, as shown in Fig. 17.3. Applying the sum rule to the 
initial state gives 8/(a + y) = $ and to the final state (œ + 8)/y = 4, 
from which a:8:y = 1:9:5. This sum rule is of great importance in 
the practical analysis of spectra, since the spectroscopist is first faced 
with a large number of lines which must be organized very carefully 
before a term diagram such as Fig. 17.2 can be established. The sum 
rule gives reliable predictions only for small multiplet splitting. 


2J+1 2y+1 
2P, 4 : 
f 2D; ] 4 
2P, 7 2 S 
$ L 2 a |1 |9 vjs 
| 
2P, 4 
Cwn a 
25 A laa 2P, 2 
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.3 Transitions between doublet levels and (below) rela- 
an i of the lines. If S lies above P, the order of the 
lines is reversed; similarly if P lies above D. 
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As confirmation of the association of multiplet structure and the 
term diagram, the wavenumbers and separations of the first four lines 
of each of the chief series in the spectrum of neutral sodium are listed 
in Table 17.1. The theoretical conclusions are secn to be confirmed by 
the observations. The sharp series, 3P <— mS, consists of spectral dou- 
blets, and, as shown in column 7 of Table 17.1, the frequency difference 
is constant within the experimental errors, representing the difference 
between the 3°P, and 3?P, levels. This fact helps also to confirm the 
identification of the S levels as made above. 

The principal series of lines, 3S +— nP, also consists of doublets but 
with frequency differences that represent the spacings between the two 
levels in various P terms and hence decrease rapidly with increasing n. 
The separation of the two levels in a term with quantum numbers n and 
l, due to the spin-orbit effect, is calculated from Eq. (14.23) to be 


Ap = Emn — AB y _ Zell + a + $) — W- a +d) 
< he (he) L6reomcan*nl(l + HUF 1) 
@ORZ 4, mas Zin = 
=a) 5.84 ml +1) cem7! (17.6) 


where Zar = effective atomic number 
R = Rydberg constant 
a = fine-structure constant, Eq. (9.10) 
Thus in a one-electron atom the spin-orbit separation decreases as 1/n’. 
The observed decrease in the spectrum of sodium is even more rapid, 
presumably because the effective value of Z decreases as n increases. 

The lines of the diffuse series, 3P —mD are compound doublets. 
One line should be very weak, however. The two brightest lines result 
from the jumps 3*P,;<—m *D, and 3?P,<—n *D;. If Eq. (17.6) holds 
roughly for sodium, because of the factor I(l + 1) in the denominator, 
the separation nD, — nD, should stand to the Separation nP} — nP, 
in the ratio 1/(2 X 3):1/(1 X 2) or as only 1:3; and, as n increases, the 
D separation should rapidly diminish further. With ordinary resolving 
power the lines of the diffuse series are observed as apparent doublets 
with a frequency difference that is nearly constant and equal to the 
difference 3P} — 3P, (Table 17.1). Doublets of exactly this separation 
are formed by the faint line 3P; = nD; and the brighter line 
3P; = mD, 

Thus, the theory accounts very well for th 
spectrum emitted by neutral sodium atoms. 
the other alkali metals, the spectra of which a 
to that of sodium. 


e principal features of the 
It is equally successful with 
re qualitatively very similar 
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Table 17.1 Doublet intervals in the Nal spectrum 


Initial Ele 
A Term Interval 
Series J Wales Av, cm~! 
cm1 », A ï, cm 
Principal; 4 | 24,493.47 5,895.92 | 16,956.18 17.20 
37S, — n2Py 3 | 24,476.37. | 5,889.95 | 16,973.38 
3S term value, 
41,449.65 cm~? 3 | 11,182.77 | 3,303.32'] 30,266.88 5.63 
4 11,177.14 3,302.99 | 30,272.51 
+ 6,409.38 2,853.03 | 35,040.27 2.52 
$ 6,406.86 2,852.83 | 35,042.79 
3 4,153.14 2,680.44 | 37,296.51 1.25 
3 | 4,151.89 | 2,680.34 | 37,297.76 
Sharp; + | 15,709.8 11, 481.2 8,783.7 17.2 
3*Py — nS; pi 11,403.6 8,766.5 
Term values, 
cm~! 3 8,249.0 | 6,154.23 | 16,244.5 17.2 
3?Py — 24,493.47 3 6,160.75 | 16,227.3 
3*Py — 24,476.27 3 i 
4 | 5,077.0 | 5,148.84 | 19,416.5 17.2 
$ 5,153.40 | 19,399.3 
+ 3,437.6 4,747.94 | 21,055.9 1152 
# 4,751.82 | 21,038.7 
‘ 
Diffuse; 4 12,276.8 8,194.82 | 12,216.7 17.2 
3P; — Dt $ 8,183.26 | 12,199.5 
4 6,900.9 17,592.6 17.2 
$ 17,575.4 i 
4 | 4,412.9 20,080.6 17.2 
El 20,063.4 
3 3,062.4 21,431.1 17.2 
3 21,413.9 S 


t Only the mean value of the Dj and D; level is shown and only the two strong lines. 
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The separation of the doublet levels increases rapidly with increasing 
Z.. In Table 17.2 are shown the wavelengths and the wavenumbers 


Table 17.2 First lines of the principal series for the alkali metals 


Li Na K Rb Cs 

z ` 3 u 4319. 37. 55. 

n f LB aes 4 5 6 
ACS, — 2P;), A 6,707.8 | 5,895.9 | 7,699.0 | 7,947.6-| 8,943.5 
AAS, = Py), A 5,890.0 | 7,664.9 | 7,800.2] 8,591.1 
PES =P prem | 14,904 | 16,956 | 12,985 | 12,579 | 11,178 
pS, Pa), cm | . 16,973 | 13,043 | 12,817 | 11,732 


Ay, em-? 0.34 17 58 238 554 


p eS 


of the D lines or their analogs, i.e., the first lines of the principal series, 
for all the alkali metals and also the doublet differences for these lines. 
The enormous increase in the doublet separation with Z for alkali metals, 
in spite of the progressive increase’in n, is in strong contrast with the 
more moderate variation in the wavelength of the lines in question. 
This results from great sensitiveness of the spin-orbit effect to the char- _ 
_ acter of the central field near the nucleus. , ‘ 


17.4 The Spectrum of Helium 


During the eclipse of 1868 Janssen detected helium lines in the solar ` 
spectrum. Lockyer suggested the name helium (helios is the Greek word 
for sun) for this clement, spectral lines from which were observed many 
years before Ramsey first isolated helium on the earth in 1895. Long 
before the helium spectrum was understood, measurements òf the wave- 
lengths of the spectral lines had led ‘to the energy-level diagram shown 
in Fig. 17.4, in which there are two sets, of levels. No transitions were 
observed from a level in one set to a level in the other. At first it was 
believed that there might be two kinds of helium, called orlhohelium and 
parahelium, one associated with each of the term systems. The para. 
helium terms led to seriés of singlet lines, while the orthohelium terms 
led to series of triplets. When efforts to separate these two types of 
helium proved futile, the two systems of terms were eventually explained 
in an entirely different way, viz., an orthohelium atom has two electrons 


hid 
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Fig. 17. 4° Term diagram for neutral helium showing the division of terms into non- 


combining singlet and triplet terms. 


_ with spin parallel and a parahelium atom has UD cect with opins 
antiparallel (see Sec. 15A.2). 
/ The resul: resultant spin angular momentum for two electrons is given 
by: EVANS S(S + 1) Å, where 
San@n-3O} . | at 


k 


20 


ds 


eV 


10 
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which has possible values 1 or 0. Heré @ -implies quantized vector 
addition, as in Sec. 14.8. For orthohelium S = 1; for parahelium S = 0. - 

TA An outstanding feature of the triplet terms of orthohelium is that 
there is no ground state corresponding to that of the singlet system. 
According to the Pauli exclusion principle, two electrons in the n= 1, 

1 = 0, m = 0 staté cannot have the same value of m.. Thus only for 
parahelium can the ground state involve a 1s? configuration; for ortho- 
helium the lowest configuration is 1s2s. s > pee 

For parahelium S = 0, so J =L ® S= L. Since for each value 
of L there isa single value of J, the energy levels are singlets with no 
fine structure and are designated by 4So, 1P}, 4D, etc. For orthohelium - 
‘S = 1, and so J = L @S has the values L + 1, L, L — 1 (but never 
minus values). Thus, except for L = 0 levels, each nL state has three 
fine-structure levels. Orthohelium levels are triplets, designated by such 
symbols as *S,, *Po, 3P}, *P2, Di, and so forth. 


17.5 Many-electron Wave Theory 


When an atom contains two or more electrons outside closed subshells, 
it is generally true that in a ‘radiative transition only one electron is 
active but the atomic energy levels between which the jump occurs are 
themselves influenced by the Presence of the other electrons. As a 
basis for the discussion of such cases, we describe briefly the various 
interactions which must appear in the complete hamiltonian function 
Tequired to calculate the wave function for a neutral atom. Although 
the exact relativistic wave equation for a multielectron atom is not known, 


an approximate equation can be written in which the following terms may 
occur: : 


1. Terms representing the kinetic energy of the electrons; for the jth electron 
this takes the form — (h?/2m)v;2. 


2. Potential-energy terms representing the electrostatic interaction energy of 
each electron with the nucleus. 


3. Potential-energy terms representing the mutual electrostatic energy of the 
electrons. s 


Spin-orbit terms of the form of Eq. (14.23) for electrons with l >0. 


4 
5. Mixed spin-orbit terms representing interaction between each spin magnetic 
* moment and the orbital motions of other electrons. 


pO 


Spin-spin interactions between the spin magnetic moments of the electrons. 
- Interactions between the magnetic moments arising from the “orbit: I. 
motion of the electrons. ° 
8. Relativistic corrections, presently only partly known. 
9. Terms allowing for nuclear motion, 


l a for interaction between nuclear spin and 
the electron magnetic moments, interactions of electrons with the nuclear 
` quadrupole moments, and so forth. ; 
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Fortunately, interactions associated with classes 5 to 9 are ordi- 
narily small compared with the first four. The largest term ‘in the 
potential energy is that of the nuclear electrostatic interaction (class 2) 
followed by that of 3. The lattér may be split into two parts: a spheri- 
cally symmetrical one and a nonspherically symmetric one. The first 


part we have taken into account in terms of shielding constants, which. 


have the effect of reducing the nuclear charge from Z to Zer The 
second part is associated with electrons outside of closed subshells and 
has not been treated as yet. Particularly for light elements, the effect 
of the nonspherically symmetric electrostatic field is the largest of the 
remaining terms, bt for heavy elements the spin-orbit interaction is 
greatest. In developing the theory one considers the largest terms first 
and then treats smaller terms as perturbations. We shall discuss first 
the situation in which the residual electrostatic interaction dominates 
(LS coupling) and later that in which the spin-orbit interaction dominates 


(jj coupling). 


a. Electronic Configurations and Coupling’ Schemes The possible states 
of a single electron in the modified central field are characterized by 
quantum numbers n and l. As assignment of n’s and l's for all, Z elec- 
trons in the atom defines an electronic configuration. For example, a 
C® atom in a 1s?2s?2p3d configuration has a ls? core, two electrons in 


2s states, one in a 2p and another in a 3d state. Within this configuration ` 


a number of different energy states (or energy levels) ate possible, each 
with its own total energy. Often the words lerm or lerm value are used 
in place of energy level, a hangover from the work of Rydberg and Ritz 
(Sec. 9.3).- When'a radiative transition occurs, the photon has energy 
hy equal to the energy difference between initial and final states. For 


optical spectra, only the electrons outside of closed -subshells. undergo , 


transitions. ‘ $ 

An important selection rule holds for configuration changes. In 
general, radiative transitions occur only between configurations that 
differ in just one of the n,l electronic states; furthermore, the values of 


in the initial-and final states.must differ by unity. Thus transitions > 


may occur between levels belonging to. a 2s3p configuration and a 2s3d 


configuration, but not between 2s3p and 2p3d: because hoth l’s are dif- 
feront; aud not between 2s3p and 2s4/ because Al = 2... The selection 
tule: Al =) Lediminishes enormously the: numberof transitions ‘that 
have to be considered in spectroscopy." ` ‘ 
t/Phe aulesholds: strigly iso long as) tive Nave function Corrésponds exactly to 
saingle contigurations fy] some cases, yaent siliya ponents; appreciable magni- 
tude belonging to other vontizurations, for sun h Gb @ 24,8 (, differs 
fy an even integer. ins 1 iwo-electron jumps, may eccar 


in which two electronic Us chunge, With 41 = 2 for the second 3 


z ‘ 
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17.6 LS, or Russell-Saunders, Coupling 


In atoms that are not too heavy, the residual electrostatic interaction 
has a_relatively large effect. For simplicity of discussion, let it be 
assumed for the present that spin-orbit interactions are initially negli- 
gible. Consider two or more electrons outside of closed subshells. By 
virtue of the residual ‘electrostatic repulsions, which are not directed 
- toward. (or away from) the nucleus, the individual orbital angular 
momenta, of these electrons are no longer constant, but their resultant 
is unaffected by these forces. This resultant angular momentum Ar 
has’ a magnitude »/L(L + 1)h, where 
L=h@®LOh@-:- 3 ; . -(17.8a) 
Here © means quantized vectorial addition (Fig. 17.5) such that if 
h = 2and.=3,L=h .lı =5,4,3,2,orl. Ifg=1L=hehk@ 
ls = 6; 5, 4, 3, 2, 1, or 0.. The (degenerate) quantum states can also ` 
be chosen so that the. component of A; in the direction of any chosen 
axis is Mzh, where the 2L + 1 alternative values of Mz are integrally 
spaced from —L to L. Thus Li is the maximum observable yau of the 
component of A; in any direction. , ‘ 

In addition to orbital angular momentum VII Dh, ash elec- 
tron has-spin’ angular momentum +/(3)($)h. As the result of a strong 
quartum-mechanical exchange correlation between the spins of electrons, 
there is an effect which tends to align electron spins parallel to each 
other and which further, tends to keep electrons with parallel spin away 
from “each cther. `The spin angular momenta have a resultant As of 
magnitude +/S(S + 1) Ih, where 


S=1@s 0s 08---=4@02030°-- (17.88) 


Fig. 17.5 Schematic diagram suggesting how lı GA gives L = 0, 1, or 2 when 
i= 1, = Land gives L = 1,2, or 3 when hj= 2,1, = 1. ` 


446 Introduction to Modern Physics 
Á $ 


The component of As ‘in the z-direction has a value Mgh, where Ms is 
any one of the integrally spaced numbers S to —S. Since each elec- 
` tronic spin can contribute at most h/2 to the axial component, the maxi- 
mum possible value of Ms, and hence the value of S, is V/2 foran atom - 
containing N electrons outside of closed subshells. Both S and Ms are 
thus integers or zero, or half integral, according as /V is even or odd. 
ee In some configurations, two or more independent sets of LS quan- 
tum states occur having the same values of L and S. 

Zero-order L,S,Mz,Ms wave functions are easily formed as linear 
combinations of the n,l,mi,m, functions. The L,S;Mz,M s quantum states 
are not usually characterized by definite values of the individual mys. 
and m,’s; the individual axial components of electronic angular momen- 
tum are not definite (except in a few cases). This corresponds to the 
fact in classical mechanics that fixing the vector sums Az and As does 
not fix the individual vector momenta A; and A,. Repulsion between 
electrons would usually have little average-effect on the magnilude of 
any individual Aj; it would merely speed the electron up and theri slow 
it down again during its orbital motion. A component of this repulsion 
normal to the orbital plane, on the other hand, would cause a precession 
of the individual A; vectors about the fixed direction of their resultant 
Az, in analogy with the effect of a torque applied to a gyrostat. Such 
precessional motions symbolize vividly the necessity of abandoning the 
ms and m,’s and quantizing only the resultant momentum when full 
account is taken of electronic repulsion. The motions themselves, how- 
ever, have no analog in the wave-mechanical description of the quantum 
states. . i i V a A 

Simple selection rules. hold for L and S; this is the most important 
property of such quantum numbers. The rules are 


AL = 0 or +1 (and not 0 —> 0) (17.90) 
AS = 0 : : (17.9b) 


The restriction that AS = 0 corresponds to the fact in classical 
mechanics that the existence of spin and of its dssqciated magneti 
moment has no appreciable (direct) effect upon the emission of radiation.. 
Because of this selection rule, in the absence of all spin-orbit interaction, . 
the atomic energy levels fall into broad, noncombining classes, each 
distinguished by a different value of S; levels belonging to one class- 
combine in the emission or absorption of spectral lines only with other 
levels belonging to the same class, as we saw for helium. i The selection 
rule for L refers, however, only to-dipole emission of radiation. Weak 
lines violating these rules are frequently seen, for example, \ 3427 in 


Fig. 17.2. 
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The occurrence of atomic states characterized by quantum num- 
bers L and S obeying the selection rules just stated is called LS or 
Russell-Saunders coupling (of the electronic momenta). 


The LS Terms In zero order, only a single energy level belongs to a 
given configuration such as 3p4d. The residual electrostatic interac- 
tion then splits this single level into several levels belonging to different 
pairs of values of L and S. That this interaction should separate levels 
differing in L is reasonable. In classical theory, different values of ‘L 
would imply different electronic orbits, and the average values of the 
repulsive potential energy of the electrons should therefore also be dif- 
ferent. It may seem strange, however, that states differing only in S 
should. likewise bé separated in energy by electronic repulsions. The 
reason for such an effect lies in the different spatial symmetries of the 
wave functions associated with different values of S. The separation 
has little to do with the mutual energy of the spin magnets, although 
this would cause a slight separation if other causes were not active. -` 
In the mathematics, the separation of levels having different S results 
from exchange integrals of the electronic mutual energies. . These inte- 
grals have no classical analog. Wave mechanics thus introduces a dis- 
tinction that is absent in classical theory. ` : 4 


17.7 LS Multiplets of Levels 


To explain the observed fine structure of the spectral lines, account must 
be taken of the spin-orbit interactions as a second perturbing energy much 
smaller (in light atoms) than the residual electrostatic energy- The 
spin-orbit interactions disturb both the orbital and the spin momenta 
but not the grand vector sum of all angular momenta. In the classical 
analog, the vectors Az and.As, unless actually parallel, would precess 
steadily about their resultant Ay, which has the magnitude VJ (J + 1h. 
Here . 4 


' 


J=LƏS 


so that the possible values‘of J that can occur in a given LS term are 
all those integrally spaced values that satisfy the inequality |L — S| < 
J<L+58. As usual, if thé z direction is established by some field, 
(Ay): = Maħ, where My = J, J — I, . . . ,—J.; Thus, if L = 0, it fol- 
lows that J = S; if S = 0, J = L; in either case the level is single. If 
L > S, J takes on all the 2S + 1 integrally spaced values from L +s 
down to L — S, inclusive; whereas if L < S, J takes on the 2L +41 
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Fig. 17.6 The fine-structure splitting arising from a 4p4d configuration. 

. The Landé interval rule is illustrated by the spacings, òf the triplet levels. 
(Eromi R.. B. Leighton, “Principles of Modern Physics.” Copyright 1959. `` 
McGraw-Hill Book Company. Used by permission.) A 


integrally spaced values from S + L to S —`L. , Because of these rela- 
tions, the observed number of levels composing a given multiplét obviously 
furnishes important information about the values of L or of S that should 
be assigned to the term. The observed number‘of levels 7 must equal 
either 2L + 1 or (more commonly) 2S + 1; hence, either L = (r — 1)/2 © 
or, more likely, S = (r — 1)/2. A further test must then be made by, 
noting whether or not the yalue of L or S so inferred is consistent with 
the selection rules as applied to transitions between the given multiplet. 
of levels and other multiplets ‘for which L or S may be assumed to be ` 
known. ‘ : i 3 
_ Unless L or S is zero, the single energy level belonging to a given 
‘LS term is thus split by the spin-orbit interactions into sublevels charac- _ 
terized by different values of.J, The various L,S,J levels arising in 
this way from the 4p4d configuration are shown in Fig. 17.6. This 
. configuration gives tise, by electrostatic interaction and the exchange 
correlation between spins, to singlet and triplet P, D, and F terms. 
. The triplet levels in turn are split by the spin-orbit interaction as shown. - 
All allowed ‘transitions between two multiplets of levels, taken 


together, give rise to a’ group of spectral lines which may be called a 


spectral multiplet. i y 


17.8 Spacing of the LS Multiplet Levels 
: acing of the levels in an LS multiplet, wave mechanics furnishes 
X S Pe nd “useful formula known as the, Landé interval rule.. The 
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„increase in the atomic energy due to the spin-orbit effect, for any one 
of the J levels of a given LS level multiplet, is provided by extending 
Eq. (14.23) to the many-electron system 


AE = 4B[J(J + 1) — Li +1) = SS + 1)] (17.10) 


Here B isa constant that varies from one level multiplet to another: 
The difference between the energy of a level for J and that for J +1 | 
is the difference in the corresponding: values of AE, or y 
Es — By = 4BI(J # DUJ + 2) —J(J +1) = BY + 1) ` 
(17.11) 


This equation expresses Landé’s interval rule: 


The energy differences beliween lwo successive J levels are proportional, in a given- 

LS term, to the largér of the two values of J. 
“ The rule is of great help in determining the values of J that are to be 

assigned to various observed levels.’ A 

According to Eq. (17.10), AB is positive for some values of J and 
negative for others. The weighted average of AE vanishes, provided 
each level is weighted in proportion to the number, equal to 2J + 1, 
of the My states composing it. The total energy of the weighted-average 
level is given by the formula 


D EJ+DE 


EYA 
O. 
J 


` : < -°(17.12) 


/ It is to such a weighted-average level that'a Rydberg formula really 
refers when it is written without regard to the fine structure of the terms. 


\ 


t 


17.9 The Arc Spectrum of Mercury 


The familiar are spectrum of mercury presents spectroscopic features of 
great interest, The principal levels and many of the lines are shown 
in the usual way in Fig. 17.7, wavelengths being given in- Angstrom ` 
units. It is instructive to note how a configuration such as 6s6p gives 
rise, by electrostatic interaction, to singlet and triplet P terms, with the 
latter split into *Po, Pı, and °P; levels by the spin-orbit. interaction. 
The mercury energy-level diagram presents a fair example of LS coupling 
for two electrons. A number of intercombination lines (between singlet 

and triplet terms) are found, however, so that the LS coupling is not 
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Fig. 17.7 The most important energy levels and spectral lines for neutral mercury. 
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perfect in mercury. The configuration for the ground state is 6s?; thus 
the normal state of a mercury atom is Sọ. In four of the most important 
‘terms, viz., 6s6p 'P, 6s6p *P, 6s6d 1D, and 6s6d *D, the mere change of 
an electronic l with no change in n results ina comparatively large incre- 
ment of atomic energy. 
__ The intercombination line, 6s? 1S») <— 6s6p *Py, À = 2536.52, is one 
of the principal ultraviolet lines in the mercury spectrum. Its strength. 
_ is associated with the fact that the LS coupling approximation becomes 
less-good as Z increases. For Hg (Z = 80) the spin-orbit interaction. 
has become large, and the selection rules for L and S begin to fail (see 
Sec. 17.11). Thè transition probability between the levels is high, as 
is shown by the intensity of the line in absorption. In addition, the 
6s6p *P; state becomes densely populated as electrons drop into it from 
higher triplet states. A be 

Atoms accumulate also in the *P2 and ?P, levels of the 6s6p con- 
figuration. From these levels they cannot pass by a radiative jump 
to the normal state, for 1S — *P2 would mean AJ = 2, and 1S) > *Py 
would mean a jump from J = 0 to J = 0, both of which are forbidden 
by the selection rules for J. Levels lying above the normal state out of 
which radiative transitions are extremely improbable are called metastable 
levels. An atom can stay in suchʻa level for a long time if it is not 
disturbed by outside influences. It- may eventually be brought back 

‘into the normal state, however, by’a collision of the second kind,* and it 
is presumably in this manner that’ the.atoms in the *P, and *P» levels 
are eventually returned to their normal state. 

The selection rule for L is pretty well obeyed. With values of L 
assigned as in the diagram, S terms combine only with P terms, P 
with S and D, and so on. However, a few weak lines, not shown on the 
diagram, have been observed corresponding to AL = 2. All the transi- 
tions shown in Fig. 17.7 are in harmony with the selection rule for con- } 
figurations. Only one electron changes its n and l, and always Al = +1, 

‘The spectral lines can be grouped into series, if desired, Thus, 
within the singlet system, all lines ending on the lowest 1S level form the 
singlet principal series. Only these and the line à = 2536 A can be 
observed in absorption in mercury vapor. Of the-lines ending on the 
lowest. 'P level, those originating from 1S terms form a sharp series, 
those originating from 'D terms, a diffuse series, just as in sodium; and 
so on. Similar series can be picked out within the triplet system. It” 

i 

ì Collisions resulting in the excitation of a previously unexcited atom or 
molecule are called “collisions of the first kind,” while collisions in which an excited 
atom or molecule is returned to its ground state, the energy of excitation appearing 
as increased kinetic energy of the colliding particles, are known as “collisions of the 

second kind.” ; ’ 
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is really not very interesting to group the lines of such a complex spectrum 
into series, however, especially when the fine structure is as coarse as it 
is in the mercury spectrum. Thus the great spectral sextet of ultraviolet 
lines, à = 2967 to à = 3663, from the lowest *D to the lowest °P term, 
would constitute together the first “line” of the triplet diffuse series. 
The lines 'P1 —1Dz (5791), *P2 — #S1 (À 5461), and *P, — *S, (y 4358) 


are the familiar yellow, green, and blue lines emitted from the mercury - 


arc. 5 
It may also happen that both valence electrons are displaced into 
- higher electronic states. ` A few levels ascribed to the 6p? configuration 
have been discovered. $ 

Among other neutral atoms which have spectra similar to that-of 
mercury may be.mentioned helium; then the alkaline earths, beryllium, 
magnesium, calcium, strontium, and barium; and the close relatives of 
‘mercury; zinc, and cadmium. Ions with similar electronic exteriors are 
C++, Alt, Sit+, Pb++ (the number of plus signs indicating the number of 
positive charges on the ion). 

A second subclass of atoms with two optically active electrons is 
formed by those which, in their normal states, contain two s and two 
p valence electrons. In sich cases the two s electrons usually (but not 
always) stay put, only the two p electrons being activé. Examples of 
such atoms. are neutral carbon, silicon, germanium, tin, and lead. Fur- 
thermore, certain observed spectra of the same type have been ascribed 
to singly (positively) ionized atoms of nitrogen, ‘Phosphorss and bismuth. 


17.10 Equivalent Electrons 


When two electrons have the same n and also the same l, they are called 
equivalent electrons. In configurations containing equivalent electrons, 
such as ‘5s? or 5s7p°, certain LS terms that might otherwise occur are 
excluded through the operation of the exclusion principle. 

Consider, for example, the configuration ps? in mercury. Both 


electrons have the same-value of n, l,.and my, and so they must have i: 


opposite spin quantum numbers m,. The resultant S must þe zero. 
Indeed for any closed shell or subshell, L= = 0and S = 0; a So state 
results. 

‘Asan example of one way we may formally find the ‘possible states 
for equivalent electrons, i.e., electrons in the same 'nl subshell, consider 
a p° configuration. If we represent m, = 3 by 7 and m, = —4 by 1, 
we begin by producing an array like that of Table 17.3, in which we 
show all ‘possible combinations of spin and angular-momentum com- 
ponents consistent with the Pauli exclusion principle. The resultant 
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Table 17.3 Terms for two equivalent p electrons 


mı 


Mz Ms . 
a > H Asm |e 2m, Term 
î î 1 1 “p 
f t 0 1 ? 
tl 2 0 AD 
T l 1 0 A 
ay tT 1 0 b 
T l 0 0 $ 
i 1 0 0 A- 
tl 0 0 1S y 
l l 1 = : 
l l 9 =l T 
T l =1 0 $ 
t l L =l > 
af t —1 0 aA 
T ij ee 1 tg 
. tt -2 0 A © 


Ms is 2m., while Mz is mı. For this case the largest value of Ms is'1, 
and the largest Mz associated with Ms = 1 is also 1. This suggests a 
3P term for which Mz can be 1, 0, —1,.and Ms l, 0, —1. The nine 
-entries marked by asterisks satisfy these requirements. (All nine must 
appear in the array; wHich of the terms with Mz = 1, Ms = 0 is chosen 
is of no importance herey The highest remaining ‘Ms is 0, and the 
highest associated yr is 2, so that there is a 1D term. This requires 
"Mz: = 2, 1, 0, —1, ; five appropriate combinations are marked by S 
. When this is done, eee remains unassigned a single Mz = 0, Ms = 0, 
which’ corresponds to a 4S state. Thus the possible terms for a p°? 
configuration are *Po1,2, ‘Ds, and 1S0; many fewer than a pair of non- 
equivalent p electrons (np, n/p) which ‘has the possibilities *D41,2,3, 
UD», Po.: Pi, 3S, and tSo. >; 

The possible terms for other configurations of equivalent electrons 
can be obtained by a construction similar to that of Table 17.3. Actually 
it is unnecessary to concern oneself with any term in which Mz or Ms 
is negative, since there is a one-to-one. correspondence between the nega- 
tiye and positive values. Thus all the entries below the line in Table 
17.3 could have been omitted. Table 17.4 shows the possible terms for 
yarious equivalent electron configurations and the total number of inde- 
pendent states arising from each configuration. 

Since the contribution of any filled subshell to L and Si is always 
zero, it is clear that if one electron is missing from a closed’ subshell, 
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Table 17.4 LS terms for equivalent electrons 
EE 


Ind. 
Configuration: Terms Slates 
3? 1S 1 
p?p“ 3P, 1D, 1S . -15 
p? 48, 2D, ?P 20 
d?,d*° 3F, *P, 1G, 1D, *S 45 
d3,d* 4F, ‘P, 7H, ?G, ?F, *D(2), ?P 120 
d,s 5D, °H, 3G, *F(2), +D, *P(2), U,| 210 
1G(2), ‘F, 'D(2), 'S(2) 
ds 6S, 4G, ʻE, 4D, ‘P, *F, 2H, *G(2),|° 252 
3F(2), ?D(3), °P, °S itp 
pps |3, °F, *P, 11, 1G, +D, 1S 91 


EE ———eE—E——————EES 


the resultant L and S values for the closed-shell-minus-one-electron 
system must be just / and respectively. Thus a p“ configuration leads 
to a ?P term, just as a.p! configuration does. Similarly a p4 configura- 
tion has the same terms as a p? one. In general, when a subshell is 
more than half filled, the allowed terms are identical to those for a number 
of electrons equal to the number of vacancies. Table 17.4 shows several 
examples. - my ae : 

The spectroscopist is interested not only in what terms are possible 
from a given configuration but also in how the energies of the terms are 
related. This is of particular importance for determining the ground 
state of atoms. For LS coupling there are two rules which apply to 
‘the ground configuration but not always to excited ones: 

Hund's Rule. Of lerins for equivalent electrons (1) the one with greatest multi- + 
plicily 2S +1 lies lowest, and the term energies increase as the mulliplicity decreases, and 
(2) of terms of a given multiplicity, the lowest is thal with greatest L. 

The Multiplet Rule. For multiplets formed from equivalent electrons, the smallest 
value of J lies lowest if the subshell is half filled or less; the highest value of J is lowest if , 
the shell is more than half full. á 


For example, tungsten, 74W, has a d‘ configuration. Since the d 
subsh« 1 can hold 10-electrons of which 5 can have the same spin, the 
greateo. S for the ground state is (4 X 4 =) 2. Since all 4. are 
have the same spin, they must have different mi, and the biggest possible 
M,isthen (2+1+0+-1 =) 2. With L = 2and S = 2, the smallest 
J is 0 and hence the ground state for nW is *Do. Osmium (Z = 76) has a 
d® configuration, four electrons short of a.closed subshell, and so again 
the ground state has S = 2 and L = 2; However, this time the greatest 
J lies lowest, the multiplet is “inverted,” and the ground state is *D4. 

If a given configuration contains both equivalent and nonequivalent 
electrons, the LS fine structure can be obtained by beginning with the 


vn 
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equivalent electron terms and then considering the contributions of the 
other electrons one by one. For example, an excited state of nitrogen 
involves a p*d configuration. Adding a d electron to the p? configuration 
yields for the *Po,,2 term alone the possibilities, “F,.4;,3, ‘Dyan “Pra 
Fip *Dyy, and #P,,; many other levels come from combining the d 
electron with the 'Dz and 1S, terms. . 


117.11 Coupling of the jj Type 


As the nuclear charge increases, the spin-orbit effects become rapidly 
larger; as a consequence, the J levels tend less and less to group them- 
selves into recognizable LS multiplets and the selection rules for L and 
S fail more and more. Finally, in very heavy atoms the spin-orbit 
effects may predominate over the residual electrostatic interaction 
to such an extent that an approximation occurs to the other type of 
coupling mentioned in Sec. 17.5, known as jj coupling. 

In the zero-order stage of perturbation theory, let the ‘electro- 
static interaction of the electrons be ignored, except insofar as allow- 
ance is made for its average effect as represented in the central field. 
Each electron can be assumed to occupy one of the n,l j,m; states. 
A quantum state for the atom is then specified by assigning a set of 
such quantum numbers for each of the N electrons in the atom; 


` myly ja, Na,lo, ja, e., ny, lw, Jn \ 


Electrons occupying closed shells may be ignored provided these shells 

remain closed in all the radiative transitions that are considered. 
(The electronic energy is independent of m;.) ay 

y In considering only those energy levels belonging to a particular 

_ configuration it suffices to write down only a set of j’s: ju ja, . 


å + ody 
with the understanding that jı goes with nil, and so on. -An energy 
level so specified may be called a jj level or term. The number of jj 


levels belonging to a given configuration cannot exceed 2 
can assume at most two values, l + = and 1 — 4. 

In addition to the selection rule for configurations, there is now 
a selection rule for j:Aj = 0 or +1, The special rules for j. J coupling 
-can thus be summarized in the following form: 


N; for each j 


. 


T Sp Only one n,l,j set of quantum numbers can change in a radiati transition— 
“only one electron jumps at a time.” oa, ies Shiga 
_ 2. For the jumping electron, | 


Al= +1 4j =0or +1 


ae 
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Poe KiiSwanee may then be ade for the residual electrosta ostat g 
F d smaller perturbation. To prepare for this, we make 


¢ À secon 
E NE of functions containing different, electronic m's so as to 
| quantize the resultant angular momentum Az with the introduction i 
l of the usual quantum numbers J and Ms. This may be visuallzed as 
the addition of the electronic j vectors in various ways into a J vector. 
“In classical mechanics, the electrostatic forces would cause precessions 
l of the electronic resultant angular momenta A; about their vector 
sum As. ; hg i 
‘The residual electrostatic interaction then has the effect of sepa- 
‘rating slightly the J levels belonging to each jj term, forming a ji 
` multiplet of levels. A notation for these-levels is easily invented. 
Thus, analogous designations of a J level with J = 3 might be, in Jj 


or LS coupling, respectively, 


jj LS 
(ral, Ses y nylw) (ji coe Jn)s nly Beye ~nyly 5D; 


$ As the residual electrostatic interaction grows stronger, however, 
it tends to mix the various electronic zero-order functions together 
in the wave function; then it becomes less useful to attach the numbers 
(milju «-. .) to the energy levels, and the associated selection rules 
tend to fail. All degrees of coupling intermediate between the ideal 
LS and jj types occur. However, the quantum numbers J and My 
and their selection rules persist so long as the atom is free from external 
fields. í 

An example showing the transition from LS to jj coupling is 

` shown in Fig. 17.8. The relative positions of certain levels are shown 
for carbon (Z = 6), germanium (Z = 32) and lead (Z = 82), corte- 
sponding levels being connected by. dotted lines. In silicon (Z = 14), 
the corresponding set of levels is observed to be arranged much as in 
carbon, whereas tin (Z = 50) resembles lead in this respect: The 
number of closed subshells underlying the active electrons is different , 

“an the two cases, put this differenceis immaterial tothe relative arrange- 


- ment of the Jevels. 


In Fig. 17.8 we note that the J value of any level remains the ` 


i ; in-¢ Js group themselves by - 
three spectra; 10 earbon the level ; 
hele ties yalues into good LS multiplets, whereas in lead they form 
jj. groups, for which the j values are given in parentheses, The. 
observed. radiative transitions as indicated by arrows in the figure 


for carbon and lead include all that are allowed by the selection rules. 


= 


a, tana 
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Ge 


4p 5s 


3Po 7 LS coupling 


Fig. 17.8 A transition from LS to jj coupling. Within each multiplet of levels 
the spacing is drawn to scale, except that the splittings within each *P multiplet 
are exaggerated; however, the scale is different for each multiplet. For carbon 
the 2p? multiplet has terms from 90,878 to 69,231 cm™ and the 2p3s from 30,547 


to 28,898 cm~}; for lead the 6p? range is 59,821 to 30,365 and the 6p7s from 24,863 
to 10,383. 


jj coupling 


The student will find it instructive to note which lines occur with one 
form of coupling and not with the other. 

In the course of transition from LS coupling, in which the elec- 
trostatic and spin interactions dominate, to jj coupling, where the 
spin-orbit interactions are predominant, two-electron configurations 
may sometimes be advantageously described by other coupling 
schemes. For example, when one of the 3p electrons of silicon is 
excited to the 4f state, the configuration is highly asymmetric in the 


_two electrons, which now exist in quite different charge-cloud, environ- 


ments. Because of this we may speak of these indistinguishable 
particles as though we could tell them apart. In this case the spin- 
orbit interaction hs; for-the 3p electron exceeds the lla interaction. 
The latter in turn is greater than the sys. and ls» interactions. For 
this case we may ‘think af h @ sı = jy; ji @l = K, and finally 
K® s= J (Fig. 17.90). This leads to jK coupling, described b: 

(asi) j D Lik, 52}J just as LS coupling corresponds to (hah), (es) SV 
or jj coupling to [(hisi)ji, (less)jslJ. In the jK notation 2 term is 


(6) ©) (d) | 
Fig. 17.9 Schematic diagram showing LS, LK, jK, and Jj coupling for a pair of 
electrons. | 


written symbolically as ji[K]z; for the Si 3p4f example the term 
meo in LS notation as 3p4f !F; becomes in the JK description 
alla. 

When the Ll: interaction is stronger than lısı, the term may be 
best described byhObh=L,L@ =K, KOs =J (Fig. 17.96). 
This scheme {(il,)L, Si]K, s2}J is called LK coupling and may be 
written symbolically as L[K],. 


17.12 The Breadth of Spectral Lines 


No spectral line as observed is perfectly sharp, no matter how great 
the resolving power of the spectrometer. A line devoid of structure 
appears densest in the center and fades out on the edges. The full 
width at half inlensily A is defined as the separation (in à or v) between 
two points, one on each side of the center, at which the intensity is 
half as great as it is at the maximum (Fig. 17.10). (For convenience, 


wm N 
Fig. 17.10 The width A of a spectral line, 


A is often called simply the width.) The Principal causes of line 
broadening are the following. 


a. Doppler Effect The observed frequency of a spectral line ig 
changed by motion of the radiating atom in the line of sight, owing © 
to Doppler’s principle, the apparent frequency increasing if the 
motion is toward the observer and decreasing if the motion is away 
from the observer. 

In a luminous gas, such as the mercury vapor in the mercury-are 
lamp, the atoms are moving with a maxwellian distribution of velocity. 
Therefore, a spectral line emitted by a gas must comprise; as observed, 
a range of frequencies symmetrically distributed about the frequency 
emitted by the atom when at rest; this range increases with increasing 
temperature. The distribution of intensity throughout the line is 
determined by Maxwell’s distribution of velocities. According to 
Rayleigh, the brightness of the line at a distance ’ — à from the 
center is proportional to e0"-X:, where b is a constant depending 
on the temperature and on the mass of the atom. From this formula 
and the value of b it follows that the width A defined as above, if it 
is due entirely to the doppler effect, should be 


A = 0.72 X 10-8 nee wavelength units 


where T is the absolute temperature and M is the atomic or molecular 
weight of the radiating atom or molecule. 


b. Pressure Using the light from a single unresolved spectral line, 
it is possible, under favorable conditions, to produce interference 
fringes when the difference in the path of the two beams is as much 
as several hundred thousand wavelengths. In terms of classical 
theory, this fact was interpreted to mean tha 
out by any particular atom is continuous, 
phase, for at least that number of vibrations. 
may emit wave trains of this length, it must 
tions” for a Corresponding period of time. 
mean free time between collisions with other a 
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pressure of the gas for a given temperature. The higher the pressure, 
the shorter the wave trains and the more frequent the abrupt changes 
of phase. An increase of temperature, at given density, increases 
both the collision rate and the pressure. Thus in general, at tigh 
gas pressure there is a broadening due to the increasing Feeney 
of phase changes resulting from collisions. Michelson confirmed 
this by showing, from measurements with the interferometer, that 
below a pressure of the order of a millimeter, es th of the hydro- 
gen line à = 6563 A is almost entirely due tothe doppler effect but 


that at higher pressures the line becomes considerably broader, N 


The observed broadening is enhanced further by direct dis- 
turbances of the energy levels, or Of the radiation process itself, when 
another molecule comes close fo the radiating atom. 


c. Natural Line Breadth , According to wave-mechanical theory, a 
line ought also to exhibit a small “natural” width even when emitted 
by an atom at rest. As an analogy, a classical oscillator, radiating 
energy, would decrease continually in amplitude; it would emit, 
therefore, a damped wave train of finite effective length. It can be 
shown that a damped train of sine waves is equivalent to the super- 
position of a large number of perfectly regular trains of great length, 
with slightly differing frequencies. Such radiation, observed as a 
spectral line, would, therefore, be broadened slightly. In the visible 
region, the natural line breadth is mostly much less than 0.001 A and 
so is not detectable. It is easily observable, on the other hand, in 
the case of x-rays. 

Natural line breadth is associated with an indeterminateness in 
the atomic energies. The principle of the conservation of energy is_ 
always found to hold insofar as it can be tested; but measurement 
of the energy takes time, and the precision of the result is thereby 
limited. If Af denotes the time available for measurement or establish- 
ment of an energy, and if AE denotes the indefiniteness in the energy, 
by the indetermination principle 


AEAt > h/2 


For an atomic or molecular energy level, we may take At = 7, 
Where 7 is the mean life of the level, whether limited by radiative 
Processes or otherwise, as by Auger transitions. Thus 
AE = 


r 
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The quantity AE is sometimes regarded as a natural level width. For f 
a spectral transition into or out of the normal state, for which r = œ 
and AF = 0, theory gives for the natural line breadth 
AE 1 
Ar = E ~ oar 
where r refers to the other state that is involved in the transition; thea : 


A = Ad = A(c/v) ~ d2/2ncr. In general, if 7ı and ra refer to the two 
levels, respectively, 


ANA n) 


meN Tr 
The formula y = (Eı — E2)/h refers to the center of the line. 


d. Incipient Stark Effect In the spectra emitted by discharge 
tubes, a common cause of broadening is the production of a small 
Stark effect by strong electric fields, which are not great enough to | 
produce an observable splitting of the line but are sufficient to make < 
it appear perceptibly broader than it. ntherwisa wanld z 


Problems 


l. List the term designations (SL,) of all possible terms which can 
result from the following values of L and S: (a) L = 2, S = 4; (b) L = 4, 
S = 1; (6) L = 3, S = $; (d) L = 0, S = 2; (e) L = 1, S = È. 
2. Indicate the possible values of J associated with each of the following 
incomplete term designations: 'P, 2D, 1D, 3S, *P, ‘P, °H, 5P. Under 
what circumstance is the actual number of possible J values less than 
the multiplicity? For each of the terms above, indicate whether it 
arises from a configuration with an even or an odd number of electrons. 
3. Sodium atoms im the ground state 3s *S, are bombarded by elec- 
trons with 3.5 eV energy. With the help of Fig. 17.2 list the wavelengths 
of all the spectral lines which can be excited. (Assume no excited atoms 
re struck by the 3.5-eV electrons.) 
4, \ Estimate the relative intensities of the lines oi 
arismg from transitions ?D + ?F and °F — 2G. f compound doublets 
Saanel mAai a aeara by tes Ne sare meat Le lice 
of its “orbital” motion. Ton as a result 


Ans: 2) Wb/m? 
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6. From the wavelengths of the sodium D lines calculate Zer (=Z — c2) 
in Eq. (17.6) for the 3p state. 


Ans: 3.55 
7. Using the intervals of Table 17.1 and Eq. (17.6), compute the 
screening constant for the spin-orbit interaction of the 4p, 5p, and 6p 
levels. Compare the results with the 3p value of 7.45 from the preceding 
problem. How does the value of the screening constant for the spin- 
orbit effect compare with that for the energy of the n,l state determined 
by E = Rhe(Z — o)?/n*? 
8. A line arising from a transition to the ground state of an atom has 
a width A of 0.5 cm~. If the width is almost entirely due to the width 
of the initial state, calculate the approximate lifetime of this state. 
9. Write the ground electron configuration for Si. What is the ground 
state? List all the energy states which can arise from this configura- 
tion in order of increasing energy, assuming LS coupling. 
10. Write the ground electron configuration for ;N, determine the 
ground state, and list all energy levels which arise from the electron 
configuration in order of increasing energy (with the aid of Table 17.4). 
11. Write the ground-state configuration and determine the ground- 
state term for 22Ti, 23V, 27Co, and 2sNi, given that in each case there 
are two electrons in the 4s level. 
12. A level multiplet is observed: with spacings in the ratio of 1:2:3:4. 
What are possible values of L and S? Repeat for spacings of 3:5:7 
and 3:5. 
Partial Ans: 5D 


13. Prepare a diagram similar to Fig. 17.6 to show the relative loca- 
tions of the levels of all terms arising in LS coupling for the configura- 
tions (a) 3p4p; (b) 4p4f; and (c) 2p°3s. 
14. Ona figure similar to 17.3 indicate all six transitions allowed between 
3D and ?P levels. Calculate the relative intensities of the lines given 
that the line #P, 4D; is 84 times as intense as the line #P, <—*D1. 
Arn: 84, 15, 1, 45, 15, 20 
15. i: the constant B in Eq. (17.10) is 5 times as great for the *P level 
multiplet as for the 3D, show the relative frequencies of six lines of the 
preceding problem on a sketch similar to the lower part of Fig. 17.3. 
16. Find the possible terms for (a) a p? configuration and (b) a p‘ 
nfiguration. Y 
A hi the possible terms for a d? configuration. 


chapter eighteen 


Atoms in a Magnetic Field 


In preceding chapters we have asserled without proof that the angular 
momentum. of the electron system for an atom is quantized in such a way 
that when the z direction is fixed by a suitable field, the z component of the 
angular momentum can have only values which differ by an integer limes h. 
Further we have introduced relations between spin and orbital angular 
momentum and the corresponding magnetic moments. In the present chapter 
we treal the Zeeman effect and related phenomena which provided many 
of the clues from which the vector model of the alom was inferred. 


18.1 Effects of a Magnetic Field on an Atom 


The quantum states of an atom are modified when the atom is subjected 
to external magnetic or electric fields. States which are formerly degen- 
erate, i.e., associated with the same energy, may become separated, and 
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Sodium principal doublet 


Anomalous patterns 


Zinc sharp triplet 


Anomalous patterns 


Fig. 18.1 Photographs of some Zeeman patterns in a weak magnetic field, viewed 
perpendicular to B. The brackets show the position of a normal triplet in the same 
field. (From H. E. White, ‘Introduction „lo Alomic Spectra.” Copyright 1634, 
McGraw-Hill Book Company. Used by permission.) 


spectral lines arising from transitions between these States may split 


portant case is the 
y in Sec. 6.3. A few 
n Fig. 18.1; only the 
mpatible with classical 


rom the orbital motion of 
A b i tic 
moment M experiences a Cetus Ms ie ar magnet with magneti 
B, which we choose direction. Referred to a zero of 
Perpendicular, the potential energy 
Similarly, provided the field is not too 
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strong, an atom of magnetic moment y, has potential energy of orienta- 
tion given by —y.*B. If Eo represents the energy of any atomic 
level in the absence of a field, application of a magnetic field removes the 
(2J + 1)-fold degeneracy of the level and the energy associated with 
each value of M, becomes different. That is, the energy of the atom can 
be written 


E = Es— ya" B = Es — pB (18.1) 


where u: is the z component of the magnetic moment ue- 


18.2 The Normal Zeeman Effect 


For all singlet states S = 0, and so there is no net spin angular momentum 
and no net magnetic moment associated with spin. The magnetic 
moment of the electron system is due to orbital motion, and the quantum- 
mechanical yalue is 


z- Ay (18.2) 


which comes directly by finding the resultant of the contributions of the 
individual electrons given by Eq. (9.16) and then making use of the fact 
that since S = 0, L = J and Az = Ay. Since the allowed values of 
(Az): are M,h, we have immediately that, for the case-of singlet levels, 
the z component of the magnetic moment can have only one of the values 
—(e/2m)Myh. Consequently, a singlet level of initial energy Æo in the 
absence of a magnetic field is split by an applied B into 2J + 1 equally 
spaced magnetic substates given by 


E = Es + $È BM, = Eo + usBM, (18.3) 


where up (= eħ/2m) is the Bohr magneton. 
Figure 18.2 shows the splitting for 1D, and 'P, levels; the energy 
spacing between adjacent sublevels is (eh/2m)B. Transitions occur 
only between magnetic sublevels for which the selection rule AM; = 0 
+1 is satisfied. There are three transitions in Fig. 18.2 for which 
AM, = 0, but all three lead to the emission of the same frequency 
as that of the line from the 'P; — 1D, transition in the absene e 
field. Similarly, the three AM, = +1 transitions lead t rene 


of frequency vo — eB/4rm, while the AM, = o a single line 


—1 transitions give a line 


a 
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ith B M; Mig 
NoB With af 2 


T 


Fig. 18.2 Normal Zeeman 
1P, — 1D; transition 
the nine allowed transitions. % 
the o lines is +e%B/2m from the ~ line. 


| v—> 
pattern arising from a 


- Only three lines result from 
The displacement of 


of frequency vo + eB/4am. These are the observed frequencies, derived 
on an entirely different model’ from that of Lorentz and Zeeman, who 
nevertheless predicted the same frequencies, Eq. (6.3). 


For o polarization 
AM; = +1 
For x polarization 


AM, = 0 (but no 0 — 0 for AJ = 0) 


(18.4a) 


(18.48) 


electric vector is Perpendicular to 
ved at right angles to B (Sec. 6.3). 


=- 
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18.3 Electron Spin and the Landé g Factor 


When Goudsmit and Uhlenbeck introduced the profoundly important 
concept of electron spin in 1925, they found that assigning a magnetic 
moment of 1 Bohr magneton for spin led to an explanation of several 
features of atomic spectra. In terms of quantum mechanics this is 
equivalent to writing spin magnetic moment u, as —2(e/2m) times the 
spin angular momentum A,, while for the orbital case w = — (e/2m)A,. 
The question promptly arose whether one unit of spin angular momentum 
involves exactly or approzimalely twice as great a magnetic moment as 
one unit of orbital angular momentum. Let us write 


W = g: Zm A, (18.5) 


= (Direction of B) 


Ay precesses 


about B (slow) Ax and As precess 


about Ay (fast) 


Fig. 18.3 Illustrating the approximations used in treating the weak 
field Zeeman effect. (Modified from R. B. Leighton, “Principles of 


Modern Physics.” Copyright 1959. McGraw-Hill Book Company, 
Used by permission.) i 
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where g., known as the spin g factor, 
approximately 2. 

In 1934 Kinsler produced the first evidence that 
greater than 2. From careful measurements of the Zee 
*P, and +P, states of neon he found that if Eq. (18.5) is accepted, 


g: = 2.0034 + 0.0032. Subsequent measurements by a variety of tech- 
niques have led to the value! g, = 2.002319. For our purposes it will 
suffice to take g, = 2. 


We now calculate the Landé g factor for LS coupling when B is 
low enough for Zeeman splitting to be small compared with the spin- 
orbit fine structure. Then the magnetic energy can be considered as a 
small perturbation. By virtue of the torque on the magnetic moment, 
A; now precesses around Bat a rate small compared with the precession 
rate of spin and orbital angular momentum about A, (Fig. 18.3). Since 
Wo precesses about A, rapidly compared with the precession rate of Ay 
about B, the lime average of ua is approximately equal to its component 
in the direction opposite to Ay, which we shall call us. In Fig. 18.4 we 
redraw the vectors Az, As, and A, and the associated magnetic moment 
vectors, where us = Zu, = —2(e/2m)As. From the figure we have 


is possibly exactly 2 and surely 


g: is slightly 
man effect in 


Hy = pr COS 8 + us Cos ġ (18.6) 


By the law of cosines As? = Ay? + Az? + 2AŁA; cos 0 and 
Ai? = Ay? + As? + 2A;As cos ¢ 


Solving these equations for cos @ and cos $ and introducing the values 
into Eq. (18.6), along with the values for the angular momenta and the 


1 P. Kusch, Phys. Today, 19(2):23 (1966). 


Fig. 18.4 Because each unit of s 


£ pin angular momentum contributes 
twice as much magnetic moment as a unit of orbital angular momentum, 
Yo is not parallel to Ay. 


er © [<a ee el a a lO 
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magnetic moments, give 


E a [ut FDJ +1) +L +1) — s(S +] 
F 3m 2 VLL +1) VIS +1) 
+2 VSS DIJU +1) + SIS +1) — L(L+ I) 
2VS(S +1) VIJ +1) 
e aay, WV +1) SS+) -LL + 1) 
-z VIS FDA SHEET) 


= =75— As (18.7) 


Il 


where 


1 SE ECEE X 
gar pls date SS D (L+1) (18.8) 


is the Landé g factor for LS coupling. In the presence of a “weak” 
magnetic field which establishes the z direction, the allowed values of 
the z component of Ay are M,h and the corresponding possible values 
of (uy); are 


e Msh eħ 


(us): = ~9 Im Ay A; Si mm M; 
J(J +1 S(S+1)—L(L 
x [1 + | = -oMa aeo 


where us is the Bohr magneton (eh/2m). By Eq. (18.1), in a magnetic 


field each energy level is split into 2J + 1 “magnetic” sublevels with 
energy 


E = Eo + usBMyg (18.10) 


Equation (18.10) reduces to Eq. (18.3) when g is set equal to unity, 
the Landé g factor for singlet states. 


18.4 Zeeman Patterns of LS Multiplets in a Weak Field 


As examples of anomalous Zeeman patterns, consider those of the sodium 
D lines, which arise from the transitions *S, — *Py and 2S, — *P,. The 
splitting of the levels and the allowed transitions are shown in Fig. 18.5 
with the resulting line patterns appearing below. ‘ 

In discussing Zeeman patterns it is convenient to specify the dis- 
placement of each line of the Zeeman pattern from the original (zero- 
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Fig. 18.5 Weak-field Zeeman splitting for the sodi 
position of the normal triplet is shown by three dots, 
used for all Zeeman Separations. The relative heights 
suggest the theoretical relative intensities, 


um D lines. The 
The same scale is 
of the lower lines 


field) line in terms of the normal Zeeman displacement, which is gB 
in energy units and usB/h'in frequency units. The displacement in 
wavenumbers of each outer line of a normal Zeeman triplet from the 
central line is sometimes called a Lorentz unit for the field B; its value 
L = wyB/he = 0.467B cm- when B is measured in webers per square 
meter. 

For the sodium D doublet the line D, (25, — *P,) is split into four 
components, two m components with displacements of +32 and two o 
with +42 displacement. Dz (S, —*P)) ig split into six components, 
two m with displacements of +32 and four o with displacements of +1 
and +$L. The selection rule AM; = 0 leads to r polarization and 


for the normal Zeeman effect, 


AM, = +1 to ø polarization, just as 
which we now recognize as simply the Special case in which S = 0. 

It is evident that a wide variety of patterns is possible; a few are ` 
diagrammed in Fig. 18.6. Corresponding lines in the Spectral multiplets 
of a given series exhibit the same type of pattern; such lines have a 
common final level, and their initial levels have the Same J, S, and L 
and hence the same value of g. This conclusion is in agreement with 
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Fig. 18.6 Some Zeeman patterns for LS coupling in a weak field. Heights of 
lines indicate rough relative intensities; x lines are drawn above, øe below. The 
dots show the position of the normal triplet in the same field. 


observation; it is known as Preston’s rule, discovered empirically in 1898. 
Preston’s rule is often of use in deciding what lines belong together in 
a series. From Zeeman patterns it is possible in many cases to infer 
values of both g and J for the two LSJ levels involved. 

Before the development of wave mechanics a mathematical descrip- 
tion of the anomalous Zeeman effect, including the g factor, was worked 
out by Landé, who showed the necessity of using J(J + 1) rather than 
J?. A few special cases may be noted which we discuss in terms of the 
Landé factors g’ for the initial state and g” for the final state. It can 
happen that a spectral line is not split at all in a magnetic field. An 
example is the Do +— 5F; transition, in which g’ = 0 for °F; while My = 0 
for ®Dp; this case is illustrated by the line à 5713 A in the arc spectrum 
of titanium. If g’ = g”, the pattern may be a simple triplet, but of 
nonnormal separation. Similarly, a transition from a level with J = 1 
to one with J = 0 leads to a triplet line pattern with anomalous separation. 

Intensity rules for Zeeman patterns can be derived from the knowl- 


edge that the statistical weights of all My levels are equal and from the 
facts that: 


1. The sum of the intensities for all transitions from any inilial Zeeman level is 
equal lo the sum of the intensities for all transitions from any other initial level. 


2. The sum of the intensities of all transitions arriving al any final Zeeman level 
is the same as for any olher final level. 


For the normal Zeeman pattern the two ø components are each half 
as intense as the central line as seen transversely to the magnetic 
field; the e components are twice as intense when viewed along the field 
as they are when viewed transversely. This latter result is in agreement 
with the classical theory of the Zeeman effect; each circular motion is 
equivalent to two linear vibrations at right angles to each other, but in 
the transverse direction, radiation is received from only one of these, 
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the one that lies in the line of sight being invisible. Furthermore, as 
much light is polarized in one way as in the opposite way, so that on the 
whole the emitted radiation is unpolarized; and the total intensity is 
equal in all directions. 


18:5 The Paschen-Back Effect 


The formulas of Sec. 18.3 are valid only for weak magnetic fields. 
The term “‘weak’’ is, of course, relative, since fields in excess of 1000 
gauss are usually required to produce good Zeeman patterns. A 
field is weak when the total spread of the Zeeman pattern of each line 
is small relative to the spacing of the lines themselves. Figure 18.7, 
for example, shows diagrammatically to scale the D lines of sodium 
with their respective Zeeman patterns in a field of 30,000 gauss. The 
| Zeeman separations are seen to be small compared with the separa- 
„tion between Dı and Da. For these lines, therefore, 3 Wb/m? is a 
| weak field. Consider, however, the Zeeman effect in lithium. The 
| zero-field separation of the two components of the first line of the 
| principal series is of the order of 0.3 cm-1. A field of 3 Wh/m? would 
produce Zeeman separations of 1.4 cm-l, For lithium therefore, 
3 Wh/m? is a strong field. : 
As the magnetic field is increased from weak to strong, the 
approximations of Sec. 18.3 are no longer reasonable, since as B 
| increases, the precession rate due to B is no longer less than the pre- 
cession rate of Az and As about Ay. At strong fields Az and As may. 
precess separately about B. - As we have seen in Sec. 9.8, the orbital 
angular momentum vector A; for a single electron precesses. about B 
at the Larmor precessional rate w, = eB/2m. Since all electronic 
Ars precess at the same rate, the magnetic field does not disturb the 
resultant orbital momentum, which in the absence of other influences 


Dı 


Do 


1.88 0.93 1.88. 
> 0.93 


boca 
"Fig. 18.7 Relative separa’ 
’ iets of 3 Wb/m*. 
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would itself merely precess at velocity a. Since the gyromaguetic 
ratio for spin is twice that for orbital angular momentum, the resultant 
spin momentum would precess undisturbed at velocity 2a, Since 
these rates are not equal, in general, the vector sum Ay of the two 
momenta is not constant. In wave mechanics a magnetic field has 
no disturbing effect upon LS coupling, but it does prevent the quantiz- 
ing of Ay. Ina weak field, J continues to be a fairly good quantum 
number, but as the field becomes strong, J disappears and its selec- 
tion rules cease to hold. 

In classical theory the vector moment of force due to a uniform 
magnetic field B is perpendicular to B; hence it has no tendency to 
alter the component of the vector angular momentum in the direc- 
tion of B. Analogously, in wave mechanics, in the presence of a 
uniform magnetic field each quantum state of an otherwise isolated 
atom is characterized by a fixed component of angular momentum 
parallel to the field; or, in cases of degeneracy, the quantum states 
can be so defined that this is true. The principal term in the expres- 
sion for this angular momentum is Mh; the quantum number My 
is integral or half integral according to the rule stated in Sec. 17.1, 
and is subject to the selection rule stated there. There is also a much 
smaller component of momentum proportional to the magnetic field 
itself, but this is negligible in spectroscopy. Although the total 
angular momentum may not be fixed (so that J is no longer a, valid 
quantum number), My as expressed. in Eq. (17.3) continues to be 
significant. Since Myh = (Adet 2(A.)2 = Mit Ms)h, we may 
write for the z component of the magnetic moment of the electron 
system 


A, oe Zeus Moh awe 
ayes om [2(A): + 25(A) IX SOD: SCA £ ee Mig 
| > } (18.11) 
where the g factor is now given by 
| B(Adet 22A) ; 
I= SA) F SA) (18.118) 


The view was pipea by Paschen and Back in 1912 that if 
the magnetic field could be made sufficiently great, the (then) unknown 
cause of multiplet splitting would be overpowered and all patterns 
would revert to the normal triplet. This reduction might occur 
either through the coalescence of lines or through the disappearance 
of certain lines. In their observations, Paschen and Back were able 
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to demonstrate such an approximation toa tripletimtwo cases. Later 
they showed experimentally that in a strong field the J selection rule 


es may appear, only to disappear again as 
the field becomes still stronger. 


This phenomenon, now called the Paschen 
é hanical theory or fro 


In wave mechanics, the total Co) 
Myh, and here M, = Mi + Ms; furthermore, 2(Ai); = Mzħ and 
2(A.); = Msh. Thus, as in a weak field, 2(A)). and 2(A,), are inde- 
Pendent of B, and from Eqs, (18.4), (18, 


6), and (18.114,b) we have, 
as an approximation valid ina sufficiently strong field, 


mponent in the B direction is 


M. 
aiaa 8.11e) 
u: = — (Ms + Ms)up 8.110) 
E =E, +4 (Ms + Ms)usB (18.12) 


€ 7 = 0or +1, AM, = 0, the normal 
triplet now arises just as in the case of one electron in a huge field. 


however, 


strong-field pattern, of the same 
as the original fine structure in the 
E = Eo + (Ms + Ms)usB + KM:Ms 
The transition from a weak 
Fig. 18.8 for a *S — ?P transition such as that whic 
D lines of sodium. The plots are constructed relativ. 
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Fig. 18.8 »(a) Paschen-Back effect on 1P energy levels in a magnetic field. Eis 
energy in arbitrary units; AE is the } — } Separation in zero field, taken as.1.5 on 
the ordinate. In z, AF is in joules and B in webers per square meter. (b) 


lated Paschen-Back effect on the 2S — 1P Zeeman pattern with a few observati 
by Kent on lithium. - k 
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between the P} and P; levels in zero field, so that they are valid for 
any case of this type; the abscissa is proportional to field strength. 
| In Fig. 18.84 are shown the calculated positions of the +P levels, labeled 
with values of My = Mz + Ms, and with quantum numbers for no 
field and strong field shown at the sides of the plot. The two °S 
levels simply separate in proportion to B and are not shown, In 
Fig. 18.86 the calculated positions of the 10 lines for the transition 
2S —2P are similarly shown, m and o lines being so labeled. Lines 
which become weak and would ultimately vanish in an infinite field 
are indicated by the symbol “=> 0.” It will be noted that as the field 
becomes strong, two of the ?P levels coalesce, and of the 10 yeak- 
field lines, 4 disappear while the others tend to coalesce in pairs. 

A corresponding theory can be developed for jj coupling. A 
help in tracing the levels is provided by the rule that levels haying 
the same My do not cross as the field increases in strength. 

Jt should be added that in the discussion so far the magnetic 
field has been tacitly assumed not to he so extremely strong as to bring 
together levels belonging to different terms, such as a P and a D. 
If this limit could be exceeded, the normal triplet should disappear 
again, to reappear only in a field so huge as to swamp all other influences 


except that of the central field, thus producing the case described 
below. 


18:6 Zeeman Effect in a Huge Field 


The simplest type of Zeeman effect should be produced by a field so 
strong that complications due to all other sources can be ignored. Such 
a field we shall call huge. ss 

Consider first a single electron in a central field, placed in a uni- 
form magnetic field B. Since we are assuming that the spin-orbit 
effect is swamped by the huge field, we can use the n, mym, quantum 
states with the axis parallel to B. When the atom is in one of these 
states, (A:)- and (A,), have the magnitudes 


(Ai). = mh (A) = mah 


_ Therefore, 


eee A (mi + 2m,) = —ua(m; + m) 


aie 
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where m; = mı +m.. Since p. does not change (appreciably) with 
further increase in B, Eq. (18.1) gives for the energy 


E = Eo + uaB (m; + m) (18.13) 
Here m; + m, is always an integer or zero. 

From Eq. (18.13) we see that the magnetic field splits the single 
original level with energy E» into several magnetic levels, each char- 
acterized by a value of mj + m.. An example is illustrated in Fig. 
18,9. Since m; + m, may have any integral value from l + 1, when 
m: = l and m, = 4, down to — (l+ 1), there are in general 2l + 3 
magnetic levels in all; if, however, 1 = 0, there are only two levels, 
with m; + m, = +1. All the levels except the upper two and the 
lower two are double. The levels are equally spaced, the difference 
in energy being #sB.- f 

Consider now transitions between two such sets of energy levels. 
As selection rules we have, besides Al = +1, 


Am; = 0, +1 Am, = 0 
The allowed transitions; shown in Fig. 18.9, lead to the normal Zeeman 
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Fig. 18.9 Energy levels for one electron in a huge 
magnetic field. Transitions giving rise to the same 
» are bracketed. - J 
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triplet with: 


For x polarization 
Am, = 0 giving v= v 
For œ polarization 
taB 


HE Ee 


Am, = +1 giving v h 


Finally, it may be remarked that in a huge field the magnetic 
moment of the atom, in general, becomes variable, and E = E, — u:B 
no longer holds. The atom then behaves rather like a set of magnets 
connected by springs. To move such a compound magnet to a posi- 
tion where the field B is greater by dB with no change in its direction 


requires work dE = —y,dB. Thus the general relation between 
` Ha, B, and the energy Æ is 


dE 
Het = — JB (18.14) 


18.7 The Stern-Gerlach Experiment 


The space quantization of atomic angular momenta constitutes a char- 
acteristic feature both of the older quantum theory and of wave mechanics. 
An experiment which direvtly reveals the space quantization itself was 
proposed by O. Stern in 1921 and was carried out in collaboration with 
Gerlach. 

A magnet tends to move so as to increase the magnetic flux through 
it in the direction of its magnetic axis. In a uniform field, the result 
is that the magnet experiences a torque tending to line it up with the 
field. In a nonuniform field, however, the magnet experiences a irans- 
latory force F = w+ VB. 

Suppose that a beam of atoms having magnetic moments travels 
in the direction of the z axis across a magnetic field whose lines are 
approximately parallel to the y axis but whose magnitude decreases 
rapidly in the direction of +y. After passing through the field, let 
these atoms be collected on a suitable target. Then atoms which have 
a component of their magnetic moment in the direction of the field are 
deflected; if the component of the moment has the same direction as 
the field, they will be deflected toward —y; if it has the opposite direc- 
tion, toward +y. 
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According to classical theory each atom will enter the field with 
its magnetic axis inclined at some angle @ to the field, and the axis will 
then execute a Larmor precession about the field at the fixed angle 6. 
Since all values of 0 occur among the atoms, their deflections will be dis- 
tributed in continuous fashion, and the atoms, instead of forming a small 
spot on the target, will be drawn out into a continuous band. According 
to quanlum theory, on the other hand, each atom enters the field in a 
certain quantum state, defined with the direction of the field as an 
axis. Its magnetic moment in the direction of the field is gM, Bohr 
magnetons (if the field is not too strong). The beam is broken up, 
therefore, into separate beams and forms on the target a series of distinct 
spots, one for each possible value of My. 

The arrangement used in the experiment of Stern and Gerlach is 
shown diagrammatically in Fig. 18.10. The nonhomogeneous field was 
produced between pole pieces of which one had a sharp edge, so that 
near it the field was much stronger than elsewhere. A strap-shaped 
beam of silver atoms was formed by evaporating silver in a heated oven 
O and allowing atoms from the vapor to stream out through collimating 
slits; the beam (shown by the small rectangle in Fig. 18.10a) traveled 
closely past the sharp edge of pole piece P and was condensed on a 
plate at T. With no field, the beam formed a narrow line on the plate 
(Fig. 18.10c). When the magnetizing current was turned on, the line 
was not widened continuously but was divided into two lines, except 
at the ends, which were produced by atoms passing at some distance 
from the sharp edge. Space quantization of the silver atoms was thus 
clearly revealed. From the separation of the two lines and the gradient 
of the magnetic field strength, it was calculated that each silver atom 
had a magnetic moment in the direction of the field of 1 Bohr magneton. 

These results, and others obtained subsequently, are in agreement 
with the predictions of wave mechanics. The silver atom is normally in 
a 8, state, for which g = 2; thus half the atoms should have My = 4 
and a moment in the direction of the field of 1 Bohr magneton; for the 


(a) (b) (c) 
Fig. 18.10 The Stern-Gerlach experiment. (a) Approximate cross section 


of poles; (b) path of ions; and (c) the trace on the target with (above) and with- 
out (below) magnetic field. 


wl 
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= —z, and the moment is —1. Similar results were 

Sete ee AT coat by Leu for sodium and potassium, which are 
likewise in *S, states. Atoms of zinc and cadmium, in Leu’s experi- 
rents, were unaffected by the magnetic field. Atoms of thallium gave 
double line corresponding to a moment of % Magneton. Zinc and 
dmium are normally in 1S states, which have M = 0 and no magnetic 


oment. For thallium the normal state is, from spectroscopic evidence, 
Py; thus My = 4 or —4 again, but g = $. 


18.8 Isotope Structure and Hyperfine Structure 


The ordinary fine structuse~due to spin-orbit interaction does not 
exhaust the possibilities of fine details in spectral lines, Even before 
1900, Michelson and others had shown, by means of the interferometer, 
that many spectral lines possess a further structure mu 
this came to be known as Ayperfine structure. 

i After the discovery of isotopes, it was believe 
each component of a hyperfine pattern of lines 

different isotope. Later, hyperfine structure was 
spectra of some elements which consist of only one isotope. An 
example is bismuth; the line A 3596 A contains six hyperfine com- 
ponents spread over a range of 0.3 A, or 2.3 cm~. As suggested by 
Pauli in 1924, this effect is due to the fact that the nucleus has angular 
momentum and an associated magnetic moment, Thereisa tendency: 
now to limit the expression hyperfine structure to the spectrum from a 
given isotope and to refer to the other effect as isolope structure. In 
the spectrum from 4 mixture of isotopes, both types may occur 
superposed. 

In light atoms the isotope sh 
in the nuclear motion. The gj 


ch finer still; 


d for a time thet 
was emitted by a 
discovered in the 


ift may arise from simple differences 
mplest case is that of hydrogen, in 
and his collaborators to 


an be found by Substituting M = M, 
and M = 2M, successively i 


R = MR. R, 2M:R- 


Mm M, Fm 


M, being the mass of a proton and m the mass of an electron. The 


l Since Mi/m = 
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SPTE EE 


given line of 


a] 


1,836, it follows that the Hf line for deaterium should | 
lie 4861/3.672 = 1,32 A on the violet sie of that for ordinary hydro 
gen. _ This line was observed to be faintly visible, in the expected posi- 
tion, in the spectrom froma sample of common hydrogen: it increased | 
in strength as the relative concentration of, deuterium was increased. 

In the spectrum of a heavy atom, isotope shifts are found, in| 
general, to be prdportiénel to the differences in atomic mass, A | 
photograph illustrating such structure in ‘the spectrum of tungsten | 
is shown in Fig. 18.11. The spectrum was formed with a Fabry- | 
Perot etalon; hence the same pattern appears repeated many times in 


Ss ries 


diferent orders. Tungsten ccnsists chiefly of four isotopes, three — 
having mass numbers 182, 184, and 186 about’ equally abundant, 
and’ 183 about half as abundant as the others. The three observed 4 
lines are ascribed to the more abundant isotopes, any effect of 183. 
being assumed to be masked by the others: a F| 

Ta such heavy atoms, the isotope shift cannot be explained as | 
a simple difference in the effects of nuclear: motion, which are fari! 
too small. The cause lies in a departure from simple coulomb inter- 
action between nucleus and electrons, associated with tha finite: kisal 


moments. 4 eee ees eS Lee LP one 
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structure caused by the spin-orbit effect. ` The angular momentum 
of the nucleus is associated with spin and orbital angular momenta of 
the protons and neutrons of which the nucleus is composed. The 
Maximum possible value of the component of the nuclear angular 
momentum in any direction is denoted by Th; the square of the total 
is I(I + 1)h%. The spin number I is an integer or 0 whenever the mass 


in the nucleus) js even, and 
Moreover, J = 0 when- 


nd quantum number M, 
restricted to integrally spaced values such that |M;| < T; thus the 


nuclear state is (27 + 1)-fold degenerate. Protons and neutrons have 
I = 4; some other values of J are listed in Table 18.1. 


Table 18.1 Nuclear Spin and Magnetic Moment 


Magnetic Moment 
Atomic Number Mass Number Nuclear Spin HI 

Z. Element A I Nuclear Magnetons 
0 n 1 4 =1.9131 
1 H 1 4 2.7928 
2 1 0.8574 
2 He 3 2 — 2.1276 

4 0 

3 Li 6 1 0.8220 
7 4 3.2564 
4 Be 9 H ~1.1776 
5 B 10 3 1.8007 
1 $ 2.6885 
7 N 14 1 0.4036 
15 4 —0.2831 
9 F 19 3 2.6287 
i Na 23 $ 2.2176 
13 Al 27 $ 3.6414 
a Sc 45 a 4.7564 
ve Nb 93 $ 6.1670 
“a Ag 107 + =0.1135 
ets + 1.6115 


protons of mass M. 


R the effecti ; 
Moment of the proton would be, in analogy with Eq. es wee 
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trons, eh/2M, = 42/1836, with a sign corresponding to rotation of 
positive charge. As we have seen in Sec. 10.5, the magnetic moment 
of the proton is 2.793 of these nuclear magnetons, while the neutron has 
a moment of —1.91 nuclear magnetons. In general, the moment ur 
for any nucleus represents the maximum possible value for the com- 
ponent of the nuclear moment in any given direction, measured in 
nuclear magnetons, Ina nuclear orientation state in which the nuclear 
angular momentum about a given axis is Mrh, its component of 
magnetic moment in the direction of this axis is (M;/ I) ur. Values 
of ur are also listed in Table 18.1. 

The nuclear angular momentum may then be added vectorially 
to the resultant angular momentum of the atomic electrons by a process 
of IJ coupling exactly similar to the LS coupling of orbital and spin 
momenta. Such coupling is necessary whenever weak noncoulomb 
interaction between the nucleus and the atomic electrons occurs, 
For the grand resultant momentum there are then two new quantum 
numbers F = J ® I and Mr, with [i= ISES JEI and 
|Me| < F. 

Interactions between the nucleus and the electronic motions 
and spins may then slightly separate levels having different F, forming 
a hyperfine multiplet of levels. The number of these levels is 27 + 1 
if J > I but V+lify <I. If the interaction is magnetic in 
nature, equations are obtained of the same form as Eqs. (17.10) and 
(17.11) for LS multiplets except that here L, S, and J are replaced, 


respectively, by J, J, and F. As selection rulés; inaddition to others, 
we have : 


AF = Qor +1 AMr = 0 or +1 


As an alternative, however, the interaction may arise from an asym- 
metry in the electrostatic field around the nucleus, usually of the 
quadrupole type, and in this case the formulas are more cemplicated. 

A hyperfine spectral mulliplet results from transitions between 
the hyperfine levels composing two ordinary J levels, Usually it 
happens that the spacing of the hyperfine sublevels in one of the two 
J levels is much larger than that in the other, so that the former 
spacing stands out in the multiplet as observed, the finer Structure due 
to the other J level being frequently not resolved at all. The result 


is then an easily recognizable flag type of pattern, a good example 
of which is shown in Fig. 18.11. 


Asan illustration of the theory, consider the following doublet 
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“an th spectrum of thallium (aT): 


E 


$ 


| Under high resolution each line of this doubl: 


3 


EE TEETE s EA 


69% *S, —6s%6p *Py = A = 3777 Å 
3 ; : -# = 26,478 cm~! 

_ 68°Ts 7S; — 6s%6p 3P}. A = 5352 A 
5 = 18,684 cm~ 


let is seen to be made up 
of three components. Their wavelengths, estimated intensities, and | 


_ relative positions on a frequency scale are shown in Fig. 18.12a. 


To produce three component lines, as observed, there must be two 
hyperfine sublevels in each J level. Here 2 = 2I +1, end sc f= 
for the thallium nucleus. Then for J =4, F = [ + ¢orF = ler 0; 


| for J = $, F = 2 or 1. The spacing of the hyperfine levels is easily 


determined from the observed separations in the spectrum. The 
level diagram and spectral fine structure for one line of the doublet 
areshown in Fig. 18.12). The same pattern would result if the doublet 
Separations were interchanged, but the Zeeman pattern for the hyper- 
fine components unambiguously assigns the energy differences, ~ 
Hyperfine spectra are an important source of information. con- 
cerning the values of I and uy. To find az, however, requires a wave- 
mechanical calculation, which is easiest for light atoms. The sign 
of ur affects the order of the F levels; very commonly, when ur > 0, 


levels with larger F lie above those with smaller F, and vice versa for 
Br < 0. 


AY = 0.40 em-! 
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S comp s frequently arise 
| ascribed to slight departures from spherical. 


| troststic field of the nucleus, especially such a departure as ‘can be 
described by assuming an electric quadrupole moment in the‘nucleus. 
If I > 3, such a moment causes perturbations, large or small, m the J 
hyperfine pattern; it may even be the sole cause of a hyperfine splitting. 


18.9 The Stark Effect 


An applied electric field can also modify the quantum states of an 
atom, separating states which were formerly degenerate. This effect 
was discovered in 1913 by Stark, who observed splitting of the Balmer 
lines of hydrogen. When a hydrogen atom is in a. homogeneous - 
¢lectric field E which éstablishes the z direction, a perturbation energy 
—eEz is added to the coulomb energy in the potential-energy term of 
the Schrödinger equation.” This results in a first-order change’ in- 
energy. of —eE(z), where (z) depends on the electron “orbit” and its 
eccentricity. “The details of the calculation are complicated, but they’ 
lead to'the prediction that when E is weak enough for its effect to be 
smaller than the spin-orbit interaction, each level is split into.2n — 2, 
components with different energies, where n is the principal quantum 
number... The electric field, does not’ interact through’ the magnetic 
moment, and each pair of Zeeman levels m; and —m; have the same | 
energy- States of +m; and +m; correspond to the same charge dis- 
tributions and therefore have the same shift in energy in the electri¢. 
field. . 0) © kt i aal i 
For weak fields the Stark splitting: for hydrogen. lines leads to 
symmetrical patterns, and the line spacings are proportional to E. 
This ‘situation -isi described as the linear’ or first-order Stark effect. - 
When E exceeds 10” V/m, there are shifts in the line patterns which: 
are proportional to E* and we speak of the second-order Stark effect. - 
` A qualitative explanation is that under the influence of the strong. 


486 Introduction lo Modern Physics 


Problems 


1. Compute the displacement in A of the two outer Zeeman components 
from the mercury 1849.6 A line (6s? 1S, — 6s6p 'P;) in a magnetic 
field of 2 Wb/m?. 

Ans: 0.032 Å 


2. (a) Calculate the magnetic moment of an isolated aluminum atom 


in its ground state by use of the vector model of the atom. What is 
the ratio of this value to uy? 


(6) Do the same for a fluorine atom. 
3. Determine the weak-field Zeeman splitting for a line arising from 
a 4S; — *P, transition in terms of the normal Zeeman splitting. 

Ans: v lines at +0.5; ø lines at +1.5, +2 

4. Find the weak-field Zeeman pattern for a ?P, — 2D; transition in 
terms of the normal Zeeman separation. Make a sketch similar to 
those of Fig. 18.5, showing the positions and polarizations (but not the 
relative intensities) of the lines. 


5. In the triplet spectrum of zinc there are sharp series lines arising 
from *P» — *S, and diffuse series lines from *Po—*D,. Calculate the 
Zeeman pattern for both kinds of lines, To which series did the line 


at the lower right of Fig. 18.1 belong? 
6. (a) Show that g 
for S = L. 

(6) Under what conditions is g 
even and odd multiplicity. 

(c) Show that in addition to the singlet levels, g = 1 for SF; and 
7Is states. What is the general condition on S, L, and J if g = 1 for 
LS coupling? 

7. Determine the weak-field Zeeman pattern for a line arising from a 
‘Py — ‘D; transition and show the displacements of all lines of the 
pattern in a sketch similar to those of Fig. 18.6. 
yourself with relative intensities.) 

8. Calculate the Landé g factor for the 1° 


$ H, and '°G, levels and find 
the Zeeman pattern for lines arising from transitions between the two 
states. 


= 1.50 for *P, 5D, and 7F levels and, in general, 


= 0? Give an example for both 


(Do not concern 


9, (a) Show that any object with magnetic moment u experiences a 
force in the z direction given by yu, 9B,/8. when it moves through a non- 
uniform magnetic field. 


(6) Ina Stern-Gerlach experiment 


silver atoms with average kinetic 
energy of 4 X 10-79 J pass through an 


inhomogeneous field 0.75 m long 
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with a 0B./d, equal to 2500 Wb/m along the atom path. Find the 
maximum separation of the “lips” of Fig. 18.10c at the end of the magnet. 
10. A spectral line arises from a transition 3P, — 3D: for an atom for 
which J = 3. Draw a diagram showing the hyperfine splitting of these 
levels and indicate all allowed transitions by arrows. 

11. (a) Show that the classical interaction energy of two magnetic 
dipoles u, and u: separated by r is given by 


BE = #0] Mire: 3(m-r)(ue-r) 
“hike Ar r? rs 


(b) Use this relation to find the maximum interaction energy of 
one nuclear magneton and 1 Bohr magneton separated by 1 Bohr radius. 
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chapter nineteen 


Molecules and Moleeular Spectra 


That atoms attract one another is clear from the facts that (1) two or 
more atoms may join logether to form a stable gaseous molecule and (2) 
larger aggregates of atoms cling together to form liquids and solids. A 
satisfactory theory of the alom should explain how and why cerlain combina- 
tions of aloms form stable molecules. The ability of modern quantum theory 
to supply a highly successful description of inleralomic forces is of funda- 
mental importance no less to the chemist than to the physicist. 


19.1 Interatomic Forces 


To explain the binding between atoms, we consider first the interaction 
of two neutral atoms as they approach one another from a distance. 
If the electrons of each atom formed a spherically symmetrical charge 
cloud about the nucleus, neither atom would be surrounded by an 
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electric field and hence there would be little force action between them. 
However, the picture of electrons in orbitals suggests that although on 
the average the field due to electrons might just cancel the electric field 
of the nucleus, there should be also a rapidly fluctuating residual field. 
This field might polarize a neighboring atom and consequently exert a 
fluctuating attractive force upon it. When wave-mechanical perturba- 
tion theory is applied to the problem, an attraction of one atom for 
another, called the van der Waals allraclion, is found to exist. This 
attraction is important for the phenomenon of cohesion, but it does not 
play a major role in the binding of the atoms into chemical compounds. 

Chemical forces come into play when atoms approach so closely 
that their electronic charge clouds begin to overlap. New effects then 
occur. Of great importance is the eleclron-exchange effect, described for 
one particular case in Appendix 15A and discussed qualitatively in 
Sec. 19.4. Atoms held together by means of a pair of exchanged elec- 
trons are said to share a covalent bond; binding of this type accounts for 
the compounds called homopolar by the chemist. Long before quantum 
mechanics led to an understanding of the covalent bond, classical ideas 
were invoked to explain the attraction between a strongly electropositive 
atom (such as sodium) and a strongly electronegative atom (such as 
bromine). It is well known that a compound such as NaBr dissociates 
into Nat and Br- ions when it is dissolved in water. This suggested 
that there might be a transfer of an electron from a Na atom to a Br 
atom, thereby forming a pair of ions which might be held together by 
electrostatic attraction. Such a bond, called ionic or heteropolar, was 
the first to be understood and offers a reasonable starting point for 
discussing gaseous molecules. 

Although we find it convenient to discuss the limiting cases of 
purely ionic and purely covalent bonds, in reality all grades of transi- 
tion occur between the two types. Thus the bonds in molecules of 
CO, or CdS are intermediate between the two idealized cases. The 
mathematical treatment is best described by saying that such chemical 
bonds are partly of one type and partly of the other. 


19.2 The Ionic Bond 


As a specific example of an ionic bond, we shall discuss potassium chlo- 
ride. " When a K atom with a single electron outside of a closed electron 
core interacts with a Cl atom, which requires one more electron to com- 
plete the p° subshell to give the most stable of electron configurations, 
a primarily ionic bond is formed. We may think of the K atom as 
giving up its outer electron which joins the Cl atom, thereby leaving 
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K+ and CI- ions, bound together by electrostatic interaction. The 
best support for this picture is its success in predicting the observed 
binding energy and the correct general properties of the molecule. 
Clearly, a molecule formed by a positive ion and a negative ion should 
be polar in nature and have a permanent electric dipole moment. Quanti- 
tative measurements of the observed dipole moment, of the binding 
energy, and of the molecular spectra are all in excellent agreement with 
predictions based on the assumption that the KCl molecule is a union 
of the K+ ion and CI- ion with their nuclei separated by a distance of 
2.79 Å. 

When an atom of chlorine and an atom of potassium interact to 
form a molecule of potassium chloride, an energy of 4.42 eV/molecule 
is released. This is called the binding energy of the molecule. As 
expected, 4.42 eV can also dissociate the KCl molecule into an atom of 
potassium and an atom of chlorine. For a conservative system, the 
binding energy is numerically equal to the dissocialion energy. If we 
select as the zero for our energy measurement the situation in which a 
Cl and a K atom are infinitely far apart, the KCl molecule in its ground 
state will have a negative energy equal to the dissociation energy. To 
obtain an estimate of this energy, imagine that we form a molecule by 
a series of idealized steps, which are not those by which an actual molecule 
is formed. But since the processes involved conserve energy, the dis- 
sociation energy is independent of the manner in which the molecule is 
formed. We begin with a K atom and a Cl atom infinitely separated. 
Let us remove an electron from the K and give it to the Cl atom, thus 
producing the corresponding ions. To remove the outer electron from 
the K atom requires its ionization energy 4.34 eV. The electron affinity 
of Clis 3.80 eV; it represents the energy released when an electron becomes 
attached to a neutral Cl atom. The process of forming the ions has 
required from us a net energy of 0.54 eV. We now regain this energy 
and more as we allow the K+ and Cl- ions to approach one another under 
their electrostatic attraction. The potential energy P of charges e and 
—e at a distance r apart is given by 


P= — 2 = — ev A (19.1) 


Here we have evaluated the constant so that the potential energy is 
in electron volts when the separation is measured in angstrom units. 
At the equilibrium separation 2.79 A, the potential energy P is —5.16 eV. 

Clearly, the potential energy decreases without limit as r decreases 
(Fig. 19.1). There can be no equilibrium separation unless some repelling 
force is brought into play. The primary factor which limits the approach 
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Fig. 19.1 The heavy curve is the energy of a KCI molecule as a 
function of the distance r between the nuclei of the K+ and Cl- 
ions. The zero-energy configuration consists of a neutral K atom 
an infinite distance from a neutral Cl atom. 


of the ions is associated with the charge cloud surrounding each nucleus. 
In this case both ions. have a 1s22s?2p"3s?3p* core. Once the K and Cl 
ions are close enough for the electron clouds to begin overlapping, the 
Pauli exclusion principle requires that one or more of the electrons move 
to a higher energy state. This brings into play a strong repulsive force 
which rises rapidly as the separation is decreased, as shown in Fig. 19.1. 
The result is that there is a minimum in the potential at r = 2.79 A, 
which represents the equilibrium separation of the ions. In terms of 
quantum mechanics, as the two ions approach, the wave functions for 
the individual electrons become strongly distorted and the quantum 
states are no longer the simple hydrogenic ones which we have assumed. 
As the two ion cores interpenetrate, the Pauli principle requires that 
electrons move to states of greater energy. There is also repulsion 
between the positively charged nuclei which may be lumped with the 
10n core repulsion to give the net dashed curve in Fig. 19.1. One can 
estimate the repulsive energy from the fact that the observed dissocia- 
tion energy for KCI is 4.42 eV, while the ion potential energy less the 
cost of forming the ion pair is 5.16 — 0.54 = 4.62 eV. This suggests 
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that the repulsive energy is approximately 0.20 eV at the equilibrium 
separation. 


There are two other small contributions to the energy which 
roughly cancel each other. The first is the van der Waals attraction 
between the electron clouds, and the second is the small vibrational 
energy required by the uncertainty principle. The latter arises because 
the potential energy near the minimum in Fig. 19.1 is essentially the 
harmonic-oscillator potential; the zero-point energy of thy is a quantum 
requirement for such a system. As we see in Sec. 19.8, the harmonic 
vibrations of the atoms are a major consideration in explaining the spectra 
of molecules. 


19.3 The Hydrogen Molecule Ion 


There are many diatomic molecules of elements, e.g., hydrogen, oxygen, 
nitrogen, and chlorine, in which atoms of the same type are bound 
together. To gain an understanding of such covalent bindings it is 
desirable to begin to study a system composed of two protons and a 
single electron, which is an ionized H molecule. This is a three-body 
system for which no general solution exists. However, since the mass 
of a proton is 1836 times that of the electron, the speeds of the protons 
are much less than that of the electron. Therefore it is a reasonable 
first approximation to assume that the protons are at rest while the 
electron is in motion. We then treat the wave equation for the electron 
for some particular separation R of the two protons by letting the x 
axis lie along the line connecting the protons and putting the origin 
midway between them. The Schrédinger equation for the electron is 


a? 
2m 


u (rel 1 
Ameo {re = R/2)? + y? + zJ 
1 
+ [FRE ty Pap] Y= 092 


Vp 


When R is large, the solution of (19.2) leads to the eigenvalues of 
the hydrogen atom, corresponding to the electron being bound to one 
of the protons; thus we have a neutral hydrogen atom and a proton off 
at distance R. 

If we repeat the problem for R a few angstrom units, we find that 
because of the symmetry of the potential-energy function, any non- 
degenerate eigenfunction must have either even or odd parity. Figure 
19.2 shows approximate ground-state eigenfunctions for four values of 
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Fig. 19.2 Schematic illustration of the symmetric 
function yo and the antisymmetric yo? for four 
values of the proton separation R. The ordinate is 
the value of the wave function along the z axis. (Mod- 
ified from R. B. Leighton, “Principles of Modern Phys- 
ics.” Copyright 1959. McGraw-Hill Book Company. 


Used by permission.) 


R ranging from very large to zero. The latter case corresponds to that 
of the helium ion, for which we know that the ground-state energy is 
4 X —13.6 = —54.4 eV, while the first corresponds to —13.6 eV binding 
energy for the electron. 

The equilibrium separation for the protons of the hydrogen molecule- 
ion is that for which the energy of the system (not of the electron alone) 
is minimium. As the protons approach one another, the energy of the 
electron decreases, but the energy due to coulomb repulsion of the 
protons increases. These two energy terms are plotted in Fig. 19.3, 
where the zero of energy is chosen to be that with the two protons 
infinitely far apart and the electron bound to one of them. The energy 
of the system is the difference between the previous two; it has a mini- 
mium of —2.65 eV at Ry = 1.06 Å. The electron distribution along 
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the z axis for the equilibrium separation is shown at the top of Fig. 
19.4, while below are plotted contour lines of equal y*y for values 
0.9, 0.8, . . - , 0.1 times the maximum value as computed by Burrau. 
The probability of finding the electron between the two protons is very 
great, as one would expect; binding is achieved through the sharing of 
the electron between the two nuclei. 

It should be pointed out that only the even-parity wave function 
gives binding in this case; antisymmetric wave functions such as the 
dashed one in Fig. 19.2 correspond to increasing energy as R decreases 
and hence to a net repulsion between the protons. The: one-electron 
bond can be reasonably strong for two protons, but it is very weak if 
the two nuclei do not have the same charge. 
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Fig. 19.3 The energy of the hydrogen molecule-ion 
as a function of the interproton separation R referred ; 
to a zero-energy configuration consisting of a proton 
and a hydrogen atom infinitely far apart. 
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Fig. 19.4 The electron distribution function for the normal hydrogen 
molecule-ion. The lower curves are contour lines for value 0.9, 0.8, . . . , 
0.1 times the maximum value. (From L. Pauling and E. B. Wilson, Jr., 
“ Introduction to Quantum Mechanics.” Copyright 1935. McGraw-Hill Book 
Company. Used by permission.) 


19.4 The Hydrogen Molecule and the Covalent Bond 


When a hydrogen molecule-ion captures an electron, a hydrogen molecul 
results. In 1927 Heitler and London set up the Schrédinger equation 
for this system and solved it by approximation methods similar to those 
we used above. For very large separation R of the protons the ground 
state corresponds to two H atoms far apart. As R goes to zero, the 
ground state is that of the helium atom. For intermediate separations 
the situation is more complicated. We observe that the potential-energy 
function, and the resulting hamiltonian operator, are symmetrical in x 
once more; as a result the eigenfunctions have either even or odd parity 
with respect to z, just as we found for the H+ ion. 

When detailed calculations are made, one finds that the ground- 
state energy of the system varies qualitatively with R like the solid 
curve of Fig. 19.3, with an equilibrium separation Ro = 0.742 A anda 
binding energy of 4.48 eV. Thus the neutral molecule is more tightly 
bound than the ion, with the protons closer together. 

The probability-density function for the ground state of the neutral 
molecule behaves much like that for the ion in Fig. 19.4, except that the 
contours are considerably fatter near z = 0 as a result of the mutual 
repulsion of the electrons. Binding arises from the sharing of a pair 
of electrons by the two nuclei; this is the prototype of covalent (or homo- 
polar) binding. But a very important feature of this binding should 


again be mentioned: the two electrons involved in a covalent bond mus! 
have opposile spin. 
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The ground-state spatial wave function for the electron pair has 
even parity. But the Pauli exclusion principle permits two electrons 
to have this spatial wave function only if they differ in spin. Any 
interchange of space and spin coordinates of two electrons must be 
antisymmetric. If the spins are parallel, the second electron must go 
into a higher energy state; for the H+ molecule this yields no bonding. 
Hence two H atoms can form an H, molecule only when the electrons 
have opposite spin. 

Electron-pair bonds between dissimilar atoms are roughly of the 
same strength as between similar atoms, although one-electron bonds 
are weak unless the atoms are the same. A single electron between 
dissimilar atoms is almost always found nearer the one which gives the 
electron the lowest energy. However, for the two-electron bond, the 
mutual repulsion of the electrons keeps the electrons well separated unless 
one of the atoms has a strong electron affinity. (The latter leads to 
ionic binding.) Only two electrons participate in an ordinary covalent, 
bond. If a third electron is present, it must have an eigenfunction 
corresponding to higher energy and this results in either a much weaker 
bond (as in H-) or in no binding at all (HeH). It is possible to have 
more than one covalent bond between two atoms; e.g., in O, we have 
two covalent bonds and in N: three. In methane, CH,, the carbon 
atom has four covalent bonds, one to each H atom. 


19.5 Binding between Square Wells 


The mathematical complexities of dealing with atoms in three dimensions 
are considerable, but many of the most important features of atomic 
binding are shown by the mathematically simple problem of binding 
between one-dimensional square wells. We take two identical square 
wells of depth Po, each containing one electron, initially far apart and 
study what happens as we bring them together (Fig. 19.5).- On each 
plot the wave function y for the ground state is shown. 

Now y and —y are equally good eigenfunctions. When the wells 
are far apart (no interaction), the energy of the system is the same 
regardless of what signs are chosen for y at the two wells. As the welle 
begin to interact, the sign of the total yr remains unimportant, but we 
get different eigenvalues for the energy, depending on whether the si i 
of y is the same in both wells (symmetric yr) or opposite (antisymmetrity: 
yr). That this is true comes from the fact that the symmetric yr is‘ 
associated with lower kinetic energy. The expectation value for K, 
given by Eqs. (13.14) and (13.15), depends on |dy/dz|?, and Aleatly 
|dy/dz| is smaller for the symmetric case.. By the time the wells meet 
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Symmetric 


Antisymmetric 


Fig. 19.5 | Symmetric and antisymmetric wave functions for two square wells 
(a) far apart, (b) close together, and (c) so close that the barrier disappears. 


(Fig. 19.5c) the symmetric Yr corresponds to the ground state for a well 
of double width, while the antisymmetric yr corresponds to the first 
excited state. 

The energy of the system corresponding to the lowest symmetric 
and antisymmetric states is plotted in Fig. 19.6. For the symmetric 
case it decreases steadily as R decreases, until the barrier disappears, 
but for the antisymmetric it rises slightly (the effective wavelength 
becomes slightly smaller). As the wells approach, the ground-state 
probability of finding one of the electrons between the wells increases 
rapidly as though the electrons were moving back and forth between 
the wells. It is often said that the electrons resonate between the 


For antisymmetric Ųų 


L 
R 
Fig. 19.6 Energy of symmetric and antisymmetric 
states as a function of separation for the two square 
wells of Fig. 19.5. 
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two wells. In terms of the resonance concept the attractive force 
(F = —dE/dR) between the two wells is due to the resonant exchange 
of electrons between them. Again the two electrons can both have the 
ground-state spatial eigenfunction only if they have opposite spin. 

The crucial feature of this model, which applies equally to bringing 
two atoms together, is this: When two identical systems approach one 
another and interact, each nondegenerate energy level is split into two leveis. 
The greater the interaction, the greater the splitting becomes, There 
are many analogous situations in physics. If two identical resonant 
LC circuits are brought together so that they interact through mutual 
inductance, the loosely coupled circuits have two resonant frequencies, 
one slightly greater and one slightly less than the resonant frequency 
of either circuit when it is isolated. Similarly, when two identical 
pendulums are loosely coupled, the interaction leads to two normal 
modes with frequencies above and below the natural frequency of each 
pendulum. 

When three identical Square wells (or atoms, or LC circuits, or 
pendulums) interact, a single energy level (or frequency) is split into 
three. In generai, for V interacting particles there is an N-fold splitting 
of states. When N is very large, as it is when Many atoms interact 
to form a solid, what was for each atom a sharp energy level may be 
spread out into a broad band of energies, as we shall see in Chap. 21. 


19.6 Molecular Spectra 


In addition to Spectra emitted by atoms, there are other spectra in vast 
variety emitted by molecules containing two or more atoms. Thus, in 


resolving power, one observes, at the extreme (violet) edge of the visible 


long-wavelength edge, called the head of the band, and are separated 
farther and farther toward the short-wavelength side. The lines are so 
close together as to appear, under low resolving power, like a continuous 
spectrum. These bands are ascribed to molecules of cyanogen, CN. 
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There are also many groups or bands of lines in the infrared which have 
a molecular origin. _ 

In general, the spectrum emitted by any kind of molecule can 
be divided into lhree speclral ranges which correspond to different types 
of transition between molecular quantum states. (The spectrum of 
some molecules is confined to only one of these ranges.) Simple reasoning 
in terms of classical assumptions concerning molecular structure leads 
us to expect such a feature in molecular spectra, and the reasoning needs 
only to be translated into wave-mechanical terms in order to constitute 
a correct theoretical approach to the subject. 


a. Rotation Spectra Suppose a molecule were a rigid structure but 
contained electric charges so disposed that the molecule possessed an 
electric moment. Ifsucha molecule were to rotate, according to classical 
theory, it would emit radiation for essentially the same reason that an 
electron revolving in a circle would radiate. The radiation would 
consist of sine waves having the frequency of rotation. Conversely, 
radiation falling upon such a molecule could set it into rotation, energy 
of the radiation being at the same time absorbed. 

Spectral lines corresponding to this simple picture are observed in 


the far infrared and constitute rolalion spectra. ‘Typical photon energies 
involved are 10-5 to 10-3 eV. 


b. Vibralion-Rolalion Spectra If the molecule were not rigid but con- 
tained atoms capable of vibration under elastic forces about equilibrium : 
positions, and if the binding were ionic so that some atoms contained | 
an excess of positive charge and others an excess of negative, then accord- 
ing to classical theory, radiation would be emitted by the vibrating atoms 
as they moye back and forth. Unless the molecule were at the same 
time rotating, the frequency emitted would be that of the atomic vibra- 
tion. If the molecule. were rotating, however, the emitted line would 
be divided into two lines having frequencies. respectively greater or less 
than the frequency of the atomic vibration, in essentially the same way 
as, in the classical theory of the Zeeman effect, a magnetic field modifies 
the frequericies emitted by a vibrating electron. 

Furthermore, it would be anticipated that if the amplitude of vibra- 
tion became large, the atomic vibrations, although still periodic, would no 
longer be simple harmonic. This is true even in the familiar example of 
the vibrations of a pendulum, The. radiation emitted could then be 
resolved by, Fourier analysis into wave trains with frequencies represent- 
ing the fundamental and the harmonic overtones of the atomic vibratious. “ 


Each of these separate frequencies would then be split up further by 
rotation of the molecule. 
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Many spectra corresponding roughly to this classical picture are 
known in the infrared and are called vibration-rolalion spectra. Photon 
energies are often in the range 0.2 to 2 eV. 


c. Electronic Spectra Finally, according to classical ideas, an electron 
in the molecule might vibrate by itself and so radiate. The emitted 
radiation would be affected, however, both by the vibrations of the atoms 
in the molecule and by the rotation of the molecule asawhole. It would 
probably be one of the outer electrons that radiated in the optical region 
of the spectrum, and its frequency would be much affected by the instan- 
taneous position and motion of the nuclei. The rotation of the molecule 
would tend to split up the emitted lines as in the emission of the vibration- 
rotation spectrum. Molecular bands in the visible and ultraviolet, such 
as the cyanogen bands described above, correspond roughly to this 
third classical picture. 

The three types of molecular spectra thus characterized will be 
taken up in succession for a brief discussion. 


19.7 Rotation Spectra 


As a simple model to illustrate certain features of the behavior of actual 
molecules, we may imagine a molecule to consist of several mass points 
held rigidly at fixed distances from each other. The quantum states 
for such a molecule, according to wave mechanics, would be characterized 
by fixed values for the angular momentum, in the same way as the states 
ofanatom. The corresponding quantum number J, however, is confined 
here to integral values (ero included). The discussion will be restricted 
hereafter almost entirely to dialomic molecules. If there are only two 
mass points in the molecule, the line joining them is an axis of symmetry, 
and only rotation about an axis Perpendicular to this line has significance; 
furthermore, the moment of inertia about all such perpendicular axes 
will have the same value. 

The relation between angular momentum and energy is found to be 
the same according to wave mechanics as in classical theory. For the 
angular momentum A and energy E, we shall have, 


: therefore, in terms 
of the angular velocity w and moment of inertia J, 


A=Ia E = 1]w2 
Therefore 
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Inserting here the wave-mechanical value of A2, we have 
A? = J(J + 1)? 


2 J(J + 1)h? 
E-JJ+)Ë -= LA (19.3) 


corresponds to emission of energy and AJ = +1 to absorption. In a 
transition from state J to state J — 1, the emitted frequency is 


na ae = 2By (19.4a) 
h 
Bah (19.46) 
1041) 


12 


Fig. 19.7 Energy-level dia- 
gram for a rotation Spectrum. 
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since J(J + 1) — (J—1)(J — 1 + 1) = 2J. Thus the molecule emits 
a spectrum consisting of equally spaced lines, with frequencies equal to a 
multiple of a fized number, B. For an emission line, J refers to the 
initial state for the molecular transition; for an absorption line, to the 
final state. The corresponding type of energy-level diagram is illustrated 
in Fig. 19.7. f 

All the hydrogen halides in the gaseous state show broad absorp- 
tion lines in the far infrared which are nearly equally spaced and whose 
wavenumbers are nearly multiples of a constant quantity. These lines 
are believed to result from transitions in which the rotational state of the 
molecule alone changes, almost according to the simple theory just 
described. Thus for HCI, Czerny found a series of absorption maxima 
at wavelengths ranging from 120 to 44 u. The corresponding wave 
numbers are listed as # in Table 19.1. Under the heading J is given the 


Table 19.1 Absorption spectrum of HCl 
in the far infrared 


J ï, cm™! | Aĵ, cm~: 
4 83.03 20.70 
5 103.73 20.57 
6 124.30 20.73 
7 145.03 20.48 
8 165.51 20.35 
9 185.86 20.52 

10 206.38 20.12 

ll 226.50 


larger of the two values of J assigned to each transition. We note that 
the spacing of the maxima is almost uniform but shows a slight trend. 
A decrease in Aj as J increases, and hence algo in the apparent value of 
B, implying an increase in J, is what would be expected if the atoms are 
not tightly bound together but become slightly pulled apart by centrifugal 
action as the speed of rotation increases. Substituting in (19.4a, b) 
v = 83.03 X 3 X 10° and J = 4, we find for HCI, I = 2.7 X 10-47 kg-m?. 


19.8 Vibration-Rotation Spectra 


a. Approximate Theory of a Vibrating Diatomic Molecule A simple 
model of a diatomic molecule in which atomic vibrations can occur 
supposes the atoms themselves to be point masses that are held by a 


force that varies with the distance between them. Let this force corre- 
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spond to a potential energy P, which, plotted as a function of the dis- 
tance r between atomic centers, is represented by the curve marked P 
in Fig. 19.8. The potential has been arbitrarily taken to be zero at 
r= ©. The force exerted by either atom on the other is proportional 
to the slope of this curve. From the point ro at which P has its mini- 
mum value outward, the force is attractive; for r < ro, it is repulsive, 
rising rapidly to high values. Under the influence of sucha force, accord- 
ing to classical theory, the atoms could be at rest and in equilibrium at 
the distance rp, If disturbed moderately, they would vibrate about this 
point; if given kinetic energy exceeding — Po, however, where Po is the 
value of Patr = ro, they would fly apart; i.e., dissociation of the molecule 
would ensue. Such a picture corresponds to the observed properties of 
molecules, and it is also suggested by waye-mechanical theory. 


Internuclear separation ——_> 


Fig. 19.8 The potential energy P and vibrational levels of a 
diatomic molecule in its electronic ground state; associated 
with each vibrational state are many rotational levels. The 
dotted curve represents the potential function for a simple 
harmonic oscillator. Also shown is an excited electronic 


State with a few of its vibrational levels and some electronic- 
band transitions (Sec. 19.10), 
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According to wave mechanics, however, there would be a set of 
discrele quantum states for the positions of the atoms relative to each 
other, with energies as suggested by the horizontal lines in Fig. 19.8. 
The lowest of these quantum states would correspond to an energy a 
little greater than Po; even in their lowest state the atoms have a certain 
amount of kinetic energy, analogous to the zero-point energy hv/2 of a 
harmonic oscillator. Thus the interatomic distance is not quite fixed 
even when the molecule is in this state, although the most probable value 
of the interatomic distance will be close to ro Let us number the 
quantum states in the order of increasing energy, denoting the number 
of a state by v > 0. Let the corresponding energy be denoted by 2£,. 
The values of Æ, are all negative, zero energy belonging to a state in 
which the atoms are at rest at infinity. Thus, — Æ, represents the energy 
of dissociation of the molecule when it is in state number v and not 
rotating. The total number of the discrete states may be finite or infinite, 
depending upon the form of the potential curve. 

The wave functions and energies for the first few states should 
resembie those for a harmonic oscillator. For, if we expand P in a 
Taylor series about r = ro and note that dP/dr = 0 at r = To We obtain 


1 /@P L (dP 
P= P+ 3 (E) (7 — ro)? + 5 (TE), EO a 
(19.5) 


The first two terms of this series represent a potential function of the 
same type as that for a harmonic oscillator with (d?P/dr?),.,, playing 
the role of spring constant 6. Near r = ro the potential curve follows 
the harmonic-oscillator curye (dotted line) closely, and the lowest energy 
states correspond to those for the harmonic oscillator with v playing 
the role of n in Eq. (13.28). As long as the harmonic-oscillator curve 
is followed, the vibrational energy levels are given by 


E, = (v+a)hvo (19.5a) 


where vo = (d?P/dr*)!/2xm,} (m, is the reduced mass of the vibrating 
system), and the energy is now measured relative to the minimum of 
the potential energy curve at ro. 

As v increases, the potential energy departs from the parabolic 
approximation, and the vibrational levels get closer together. In the 
higher vibrational states the average separation of the atoms is greater 
than for the lower states. As the temperature is increased, the molecule 
is likely to be found in a higher vibrational level and the average inter- 
atomic spacing increases. The wave functions and energies tor the 
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higher states depart considerably from those for 
to the influence of the remaining terms of the series. 
rule for the harmonic oscillator, Av = +1, can 
in general, for a diatomic molecule, even A 
probability of a jump may be expected, 
rapidly as |A2| increases above 1. 
To obtain quantum states fo 


an oscillator, owing 

Thus the selection 
not be expected to hold, 
v = 0 being possible. The 
however, to fall off rather 


r the whole molecule, allowance must 
‘then be made for rotation. If we suppose that, to a sufficient approxima- 
tion, the energies of rotation and of vibration are additive, we may 
write for the total energy, by Eq. (19.3), 


E = E, + J(J + 1)Bh (19.6a) 
zah 19.6b 
B= g] (19:66) 


The various values of Æ, are called vibrational levels of the molecule; 
the energies represented by both terms on the right in Eq. (19.60) are 
called vibration-rotation levels, or simply rotational levels. 
Bh is usually very small relative to the diffe 
values of H,; hence, the rotational levels belo 
level form a closely spaced group. The ge 
rotation-vibration levels is illustrated in F 
the relative spacing of the rotational level 
and only a few of these levels are shown. 

Because of the relative smallness of B, all lines arising from transi- 
tions between two given vibrational levels will lie close together; they are 
said to constitute a band, because with low resolving power they appear 
continuous. Bands arising from transitions in which only the vibrational 
and perhaps rotational energies of the molecule change are called vibra- 
tion-rolation bands. 


For a molecule composed of two point masses, the selection rule 
for J is found to be, as for the pure rotation lines, 


In actual cases 
rence between successive 
nging to each vibrational 
neral arrangement of the 
ig. 19.9 in which, however, 
s is enormously exaggerated 


AJ = +1 


For many molecules of more complicated structure, AJ = 0 is also 
allowed. A few transitions allowed by the rule AJ = +1 are indicated 
in Fig. 19.9, 
From the approximate expression for the energy in Eq. (19.6a), 


we find for the frequencies emitted in a transition between levels v, 
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Fig. 19.9 Energy levels for a 
diatomic molecule with the 
separations of the rotational 
states enormously exaggerated 
relative to the vibrational sepa- 
rations. A few radiative tran- 
sitions for the first vibration- 
rotation band are shown. 


J — 1 and v”, J, or between v’, J and v”, J — 1, respectively, 


v, J — 10", J 


v = vw — 2BJ TEIA E (19.7a) 
v,J;0",J—1 
v= ve + 2BJ J =1,2,3,... (19.76) 
Ey — Ew 
Very = E (19.7¢) 


It is assumed here that v > v” and Ey > Ey. 
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According to Egs. (19.7a, b), the lines in a given vibration-rotation 
band emitted by a diatomic molecule should be equally spaced in fre- 
quency, with a separation 2B. It is clear, however, that the value 
v = vww Cannot occur, so that there is no line corresponding to pure 
vibration. The nearest frequencies to this are v = yyy + 2B and 


yoo — 2B. The central line of the band should thus appear to be 
missing, as is illustrated in Figs. 19.9 and 19.10. 


b. Relation belween Rolation and Vibralion-Rolation Bands There 
should exist a relation between the vibration-rotation bands and the 
pure rotation spectrum of a given substance; indeed, the rotation spec- 
trum is simply a band arising from transitions in which Av = 0. If we 
put vv. = 0 in Eq. (19.76), we obtain » = 2BJ, in agreement with 
(19.4a). Thus, the separations of the lines should be the same in the 
rotation and in the vibration-rolation paris of the spectrum, being equal to 
2B in both cases. According to (19.7a, b), the separation of the inner- 
most two lines (J = 1) in a vibration-rotation band is 4B. In the HCl 
band just described, this separation is 41.60 cm~, Half of this, or 
20.8, agrees very well with Ai as shown in Table 19.1. 


c. Bands of Higher Order and the Heat of Dissociation 

retical point that.can be tested is the prediction that altho 
for Av > 1 are to be expected, the resulting bands shoul 
weak. If the vibrational levels were equally spaced, as 
oscillator, the vibrational frequencies of the bands for Av > 1 would be 
proportional to 1, 2,3, . . . , just as they are for the harmonic overtones 
of a classical vibrating system. If the HCI band at 3.46 u arises from 


Another theo- 
ugh transitions 
d be relatively 
for the harmonic 
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Fig.19.10 The principal absorption band of IICl in the near infrared. (After 
E. S. Imes.) The numbers on the lines give the larger J for the transition ; those 
for which AJ is opposite in sign to Avare primed. (Reprinted from G. Herzberg, 
“ ©nectra of Dialomic Molecules.” Copyright 1950. D. Van Nostrand Co., Inc. 
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a transition between the lowest two vibrational levels (v = 0, v = 1), 
we expect to find bands diminishing progressively in intensity at about 
1.73, 1.15, 0.865 x, etc. Actually, by observing the absorption through 
very thick layers of HCl gas, bands have been observed at 1.76, 1.20, 
and 0.916 u, the last mentioned being 10,000 times weaker than the 
3.46 band. 

A further check is furnished by a connection with the heat of dis- 
sociation of the molecule, which is the energy that must be added to a 
molecule in its lowest quantum state in order to separate the atoms and 
leave them at rest an infinite distance apart. Obviously, the energy 
of the molecule in any bound state must be less than the energy of the 
separated atoms, else the molecule would dissociate spontaneously. 
Hence, the quantity hv for any molecular line, representing the difference 
in energy between two molecular states, must be less than the heat of 
dissociation. For the HCl band at \ = 0.916 u, hv = 1.4 eV. The 
heat of dissociation of 1 g molecule of HCI, for dissociation into H: 
and Cl:, is 92 kJ, whereas the heats of combination of 1 g atom of H 
or Cl into H, or Cl, are, respectively, 211 and 120.3 kJ. Hence to 
dissociate HCl into H and Cl requires 92 + 211 + 120.3 kJ/g molecule, 
or 4.4 eV/molecule. This is more than 3 times the value of hy as just 
calculated for the highest-frequency vibration band of HCI that has been 
observed. K 


d. Effect of Thermal Agitation upon Infrared Bands No absorption 
bands are-observed for which v > 0 in the spectrum of HCl. This is 
explained as a consequence of the wide spacing of the vibrational levels 
relative to the quantity kT. Molecules are seldom thrown by thermal 
agitation into a vibrational level which can serve as the initial level for an 
absorption band with w” > 0. For \ = 3.5 u, we find hy = 0.35 eV; 
whereas, at T = 290°K, kT = 0.025 eV. If Eo, E, are the energies of 
the two states involved in the production of \ = 3.5 p, the ratio of the 
number of atoms in the upper of these two states to that in the lower 
will be the ratio of their Boltzmann factors, Eq. (4.20), 


ML eer eThrikT = -35/2.5 < 10-6 


No 


since the statistical weights gı and go are equal. This is so small that 
practically all molecules of HCI are in their lowest vibrational states 
(w = 0) at room temperature. Hence only the absorption band for 
which v” = 0 can be observed; and the observed rotation spectrum is 
the zero-zero vibration-rotation band (v = 0 tov = 0). 

Among the various rotational levels 2 wide distribution of the 


519 Introduction to Modern Physics 


molecules occurs even at room temperature. The experimental value 
of B from Table 19.1 is 20.6/2 cm~’, making Bhe = 1.3 X 10-3 eV. 
Thus relative to the state with J = 0, the energy of any other rotational 
state is 


E; = J(J + 1) X 1.3 X 107° eV 


compared to room-temperature kT of 0.025 eV. The statistical weight 
of each J state is 2J + 1, corresponding to Mj = J,..-; =U. LO 
find the ratio of the number of HCl molecules with J = 10 to the number 
with J = 0 at room temperature, we have, by Eq. (4.20), 

nio 21 


a 2l eos = 21e = 0.07 
na ae 


so that even states with J = 10 occur 0.07 times as often as states with 
J = 0, a result in agreement with observed intensities. 

In some other substances, such as Iz, an appreciable fraction of the 
molecules are in higher vibrational states at room temperature, and 
other bands can occur in absorption or in the rotation spectrum. 


e. Infrared Bands of Olher Types of Molecules Vibration-rotation 
spectra are known for many molecules. Their structure is often more 
complex than that described above; in many cases transitions for AJ = 0 
are permitted, so that an additional sequence of lines occurs and there 
is no gap in the center of the band. Thus the absorption spectrum 
of CO, shows many vibration-rotation bands in the region from 1.46 to 
15.50 u, and that of water vapor shows many from 0.69 to 6.26 p; these 
bands, together with the rotation lines of water vapor, are responsible 
for the marked infrared absorption of the earth’s atmosphere. 

In order to exhibit yibration-rotation and rotation spectra of appre- 
ciable intensity, a molecule must possess an electric moment. Homonu- 
clear diatomic molecules, such as O2, N2, H2, Clo, possess no moments and, 
hence, have no spectra of these two types. Gases composed of suc 
molecules are transparent in the infrared. 


19.9 Molecular Quantum States 


Up to this point, the electrons in the molecule have been ignored. 
Actually, the wave function for a molecule must contain the coordinates 
and spins of all the electrons as well as the coordinates of the nuclei. 
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When atoms unite into a molecule, the inner electrons in each atom 
can be regarded as remaining associated with the nucleus, but the 
outer electrons come to belong to the molecule as a whole rather than 
to any individual nucleus. 


a. Approximate Separation of Electronic and Nuclear Motions In 
many cases, especially for the diatomic molecules, the wave function 
can be separated approximately into two factors, of which one has 
reference to the electrons whereas the second factor represents vibra- 
tions of the nuclei and rotation of the molecule as a whole. The 
first factor then represents the electrons as being in a certain elec- 
tronic quantum stale. Both the energy and the wave function of the 
electronic state depend upon the relative positions of the nuclei; in 
this respect the situation in a molecule is quite diffurent from that in 
anatom. The mutual electrostatic energy of the nuclei is commonly 
included in the energy of the electronic state. The electronic energy 
as so defined possesses a minimum value for certain relative positions 
of the nuclei; if they move closer together or farther apart, the elec- 
tronic energy rises, and so does the energy of the molecule. Thus 
the energy of the electronic state functions as a potential energy 
tending to hold the atoms together in definite relative positions of 
equilibrium. It is in this way that a potential energy such as was 
sketched in Fig. 19.8 comes to exist in a diatomic molecule. 

In this same approximation, the energy E of the molecule can be 
written as the sum of two parts, a negative part E., representing 
roughly the average of the electronic energy but including also the 
electrostatic energy due to the mutual repulsion of the nuclei, and a 
much smaller positive part Æ,» which is associated with vibration of 
the atoms relative to each other and with rotation of the molecule 
as a whole. Thus E = E. + Em. For a diatomic molecule, the 
energy Er is what was denoted by Æ in Sec. 19.8. 

For a detailed discussion of electronic states we must refer the 
student to other books, but something may be said here by way 
of explanation of the notation that the student will meet. Only 
diatomic molecules will be discussed. 


b. AS Coupling As in the theory of atomic states, electronic spin 
may be treated in different ways according to circumstances. If 
the electronic spin-orbit effects are small, an analog of LS coupling 
occurs. It is possible, as a first approximation, to assign a fixed value 
to the component of the orbital angular momentum of the electrons 
about the nuclear line, or line joining the two nuclei, which is an axis 
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of symmetry. The magnitude of this momentum is denoted by At, 
where A is a positive integer or zero: the molecular quantum number 
A ccrresponds to Mz in atomic theory. In a diatomic molecule a 
unique choice of axis is supplied by the nuclear line. The Zofal orbital 
momentum of the electrons cannot have a fixed value in a molecule, 
however; hence there can be no quantum number L. Spectro- 
scopically, A plays a role analogous to L, since states corresponding 
to different values of A have different energies; in imitation of the 
atomic notation for LS coupling, states with A = 0, 1, 2, 3, . . are 
indicated by the letters 2, II, A, , . . . , respectively. 

The electronic spins are then coupled with the introduction of a 
quantum number S. The value of 2S + 1 is written as a superscript. 
‘Thus, we obtain a set of AS electronic states represented by such 
symbols as 


yO GIA UI Y “92p Soap ETE UES ee 


As with atoms, an important significance of the quantum num- 
bers lies in the associated selection rules. There isa strong tendency 
for transitions to be limited to those for which The 


sH 
Va 
Jf 


AA=QOortl AS=0 / 

Instead of introducing next the electronic spin-orbit effects, we 
turn to the consideration of the nuclear motions, the effect of which 
upon the energy we are assuming to be much larger than the elec- 
tronic spin-orbit effects. Each electronic state of the molecule may be 
combined with any one of the vibrational States for the nuclei. The 
orbital angular momentum of the electrons is then added vectorially 
to the angular momentum of rotation of the nuclei about axes per- 
pendicular to the nuclear line. This results in the introductior. of a 


rotational quantum number K, which may have any integral value 
such that : 


KZA 


It is not possible for K to be less than A because in a diatomic mol- 
ecule the two angular momenta are necessarily perpendicular to each 
other. The general selection rule for K is that 


AK = 0 or +1 
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We obtain in this way a set of elecironic-vibration-rotation states for 
the molecule, numbered with quantum numbers A,S,v, K. 

The electronic spin-orbit effect must then be considered. We add 
the total orbital angular momentum of electrons and nuclei, repre- 
sented by K, to the total angular momentum due to electronic spin, 
represented by S, thus obtaining a grand resultant angular momen- 
tum, for which we introduce the usual quantum numbers J and Mz. 
The possible values of J are integrally spaced from K + S down to 
|K — S|. The effect of the electronic spin-orbit interaction is as in 
atomic LS coupling, to separate states having different values of J. 
Thus a fine structure, analogous to the LS multiplet structure, is 
introduced into the electronic-vibration-rotation levels. The super- 
script in such a symbol as *II refers to the (normal) number of J 
levels in this fine structure. 

In singlet levels, with S = 0, and J = K, there is no fine struc- 
ture, just as in atoms. The HCl molecule, for example, is normally 
in a 13 electronic state; hence, the discussion of the vibrational and 
rotational states of this molecule as given above was adequate. 


c. 2 Coupling When the electronic spin-orbit effects are not 
smaller than the effects of molecular rotation, other modes of approach 
by means of perturbation theory become appropriate. If the spin- 
orbit effect is actually large, we have the analog of jj coupling in atoms; 
the electronic orbital and spin momenta about the nuclear line are 
first added together, their magnitude being represented by a quantum 
number Q. The values of Q are integral or half integral according 
as the number of electrons in the molecule is even or odd. Each 
electronic state, characterized by a value of 9; is then combined with 
a vibrational state represented by quantum numbers v and J. Thus 
in this form of coupling, as when S = 0, the rotational levels corre- 
spond to the value of J. X 
The energy of a molecular state thus depends in the case of AS 
coupling upon five quantum numbers, A,S,v,K,J, or in the case of Q 
coupling upon three, ,v,J. Its variation with the various numbers 
presents several different orders of magnitude. The variation with 
i K or J is comparatively small. Spectral lines that differ only in the 
K values for the initial and final states lie close together and form a 
_ band, similar to the vibration-rotation bands that have been described. 
In such cases, if J = K, the energy varies still less with J, the effect 
“of this variation being only to introduce a fine structure into the lines 
of the band. In the case of © coupling, on the other hand, the lines 
of a band arise from differences in the values of J. The energy varies 
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Fig. 19.11 The general arrangement of molecular levels 
(spacings exaggerated between rotational levels and grossly 
contracted between 12 and ‘II groups). 


much more rapidly with v than it does with K or J. Each pair of 
values of v, one for the initial and one for the final state, gives rise to 
a possible band. The electronic states, finally, characterized by 
various values of A and S, or of Q, are separated in erergy by differ- 
ences of the order of those between atomic LS terms. These general 
features of the array of molecular levels are illustrated for a simple 
case in Fig, 19.11. 


19.10 Electronic Bands 


The most general type of transition between molecular states is one 
in which changes occur in the electronic state of the molecule as well 
as in its nuclear vibration-rotation state. The spectra hitherto 
discussed, of rotational or vibrational type, represent special cases 
in which the electronic state does not change. Such spectra con- 
stitute, however, only a small fraction of all known band spectra. 
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When the electronic state does change in a transition, the resulting 
change in energy is usually so large that the band lies in the visible 
or ultraviolet region of the spectrum. Such bands may be called 
electronic bands. 

In transitions characterized by a given pair of electronic states 
and by given values v‘ and v’, representing a fixed pair of vibrational 
states, various changes of J (and of K, if K # J) may occur, The 
resulting lines form a single band. All the bands due to transitions 
between a given pair of electronic states, for all possible values of v’ 
and v”, are said to form a band system. Because various electronic 
jumps are possible, the band spectrum of any molecule consists of 
many band systems. 

The lines in a given electronic band are limited by the selection 
rules 


AK =0or +1 AJ = 0or +1 


Even aside from the fine structure that exists when the rotational 
levels are numbered by a quantum number K which is different from 
J, electronic bands are commonly more complicated than the simple 
type of vibration-rotation band described above because transitions 
for AK = 0 (or AJ = 0) are allowed. Lines for which AK has the 
opposite sign to Av constitute the P branch of the band; those for 
AK = 0 (or AJ = 0) constitute the Q branch; those for which AK 
(or AJ) is in the same direction as Av form the R branch. The Q 
branch is frequently missing, €-g-, in all bands arising from a 2 > 2 
electronic transition. When K ~ J, whet is regarded as a single 
band sometimes contains more than one branch of each type. 

A good approximate expression for the energies of the molecular 
levels is often obtained if we employ for the vibration-rotation part 
an expression like that in Eq. (19.6a) but allow B to vary with v. 
We have then for the molecular energy, in cm’, 


E=8,+£,+BK(K+))--- (19.8) 


where B, is positive. Thus, for the levels belonging to two different 
electronic states, we may write X 
B = E; + Ep + B,K(K +1) 
B" = By + E + ByK(K +1) 
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and we then find for the three branches of the band: 


P:0',K — 1; 0',K 


B= Ve + Doron — (By + By)K + (BL — BY) K? (19.9a) 
Q: v,K; v”, K A p 

= i + dure + (By — By) K(K +1) (19.96) 
R: v',K; 0",K —1 3 

7 = et Puw + (By + BY)K + (By — Bi) K2 (19.9c) 


Here K refers in each case to the larger of the two values of K con- 
cerned in the transition. If the quantum number K does not exist 
for the levels in question, K is to be replaced by J in all equations 
from (19.8) to (19.9¢). The symbol ñ, = (E! — H:’)/h and represents 
the frequency that would arise from the electronic transition alone; 
whereas ñw = (Ep — B%,)/h, and it is assumed that v! > v”. 

Since the quantities BY, and BY, in Eqs. (19.9a, b, c) refer to 
different electronic states, they may differ considerably. We may 
suppose the forces between the atoms to be quite different in the two 
cases; hence, their positions of equilibrium and the values of the 

` moment of inertia of the molecule are also different. This is in con- 
trast to the case of the vibration-rotation bands, where B, varies only 
a little from one level to another. The quadratic terms in (19.9) 
soon make themselves felt. As K increases, the trend of 7 in one 
branch is soon reversed, in the P branch if B% > Bi, in the R branch 
if By < By, It is thus a general characteristic of electronic bands 
that one branch is folded back on itself and on tup of the others. At 
the point in the spectrum where a branch turns back, the lines are 
crowded together, forming a band head. The band appears to shade 
away from the head; some bands are shaded toward the red, some 
toward the violet. When a Q branch is present, it may form a second 
head, although such behavior is not obviously predicted by Eq, (19.90). 

A band of this type is often represented graphically by means 
of a Fortrat diagram, on which each line is represented by a point 
or circle, the ordinate representing K (or J ) and the abscissa, 5. Such 
a diagram is shown in Fig. 19.12a for the CuH band A 4280 A, which 
has no Q branch; a spectrogram of this band is shown in Fig. 19.12b. 
The band is shaded toward the red from a head which is plainly shown 
in the spectrogram. The electronic transition is 1 — Pa 

The number o, ‘tifferent bands in a band system, all arising from 
the same electronic transition but with various values of v and v”, 
may be very large. The change in v, v — o”, however, although 
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Fig. 19.13 Three band systems of CN and Cz (carbon arc in air). (Reprinted from 
G. Herzberg, “Spectra of Diatomic Molecules.” Copyright 1950. D. Van Nosirand k 
Co., Inc. Used by permission.) 


for a few bands. The Swan bands of C, result from a 4 — II elec- 
tronic transition; the violet CN bands, from 72 —> 23; the red CN 
bands, from a II — 25 transition. 

Because electrons move much faster than nuclei, an electronic 
transition occurs so rapidly compared with the vibrational motion of the 
nuclei that the internuclear separalion remains essentially constant 
during the transition. This fact, known as the Franck-Condon prin- 
ciple, is illustrated by the vertical lines of Fig. 19.8. Note that the 
average internuclear separation is greater in the higher electronic 
state. ; 

The band spectra of molecules containing different isotopes of 
the same element are slightly different, because of the difference in 
nuclear mass. Extra lines due to this cause in band spectra as observed 
have sometimes led to the discovery of rare isotopes, such as the oxygen 
isotope of atomic weight 18, present in only about one five-hundredth 
as great concentration as 0!6, 


19.11 The Raman Effect 


At this point a slight digression may conveniently be made to describe 
the Raman effect, which can occur in atomic spectra but is of greatest 
interest in relation to band spectra. 

When light passes through a “transparent” substance, solid, 
liquid, or gaseous, a certain part of the light is scattered in all direc- 
tions (the Tyndall effect). The most familiar example is the light 
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from a clear sky. Rayleigh ascribed such effects to scattering by the 
individual molecules or by groups of molecules much smaller in linear 
dimension than the wavelength of the light. If the incident light is 
monochromatic, the scattered light is ordinarily observed to be 
unchanged in frequency, in accordance with Rayleigh’s theory. 

It was shown in 1925 by Kramers and Heisenberg, however, 
that, according to classical electromagnetic theory, if the scattering 
electrons in an atom or molecule are in motion, the scattered light 
should contain other frequencies in addition to that of the incident 
light. Failure to notice this implication of classical theory had been 
due to the common assumption that the radiating particles were at 
rest except as disturbed by radiation. In 1928, independently of 
the theoretical prediction, Raman and Krishnan discovered the phe- 
nomenon experimentally, in the course of an extensive study of scatter- 
ing by liquids and solids. : 

The scattering of light with a change in its frequency has been 
studied extensively since then and is commonly called the Raman 
effect. To observe it, the incident light should be monochromatic 
and very intense. The scattered light is then seen to contain, besides 
a line of the same frequency as the incident light, several weak lines 
of other frequencies. If the incident frequency » is varied, these other 
lines move along the frequency axis at the same Tate, maintaining 
constant frequency differences from » and not changing greatly in 
intensity. In these respects the Raman lines differ sharply in behavior 
from fluorescent lines, the frequencies of which are fixed by the scatter- 
ing substance and which flash out only when the incident frequency 
falls upon an absorption line of the substance. In the Raman effect 
it is frequency shifts in the scattered spectrum that are determined by 
the nature of the scatterer rather than the frequencies themselves. 

When a photon of frequency v is scattered by an atom or molecule 
whose quantum state is not altered in the process, the scattered photon 
has the same frequency as had the incident photon. But it may 
happen that the atom or molecule is changed in the process from a 
state in which its energy is E, to a state of different energy Es. Con- 
servation of energy then requires that the frequency v’ of the scattered 
photon be modified so that hy’ + E, = hy + E;; hence, 

y= x + E: 5 E, 


In terms of such ideas the Ranan effect had been predicted at a much 
earlier date by Smekal. 
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In the expression just written, the term (E, — Es)/h can be 
interpreted as a frequency vız that the atom might conceivably emit 
or absorb in the usual way, in jumping from the first state to the 
second. Thus for the Raman line we may write 


yi = vb vie 

The difference belween the frequency of each Raman line and the frequency 
of the incident light is thus equal to the frequency of some conceivable 
emission or absorplion line of the scattering alom or molecule. 

The inlensily of a Raman line has nothing to do with the intensity 
of the emission or absorption line that is thus correlated with it. The 
selection rules for the two are quite different; transitions that are 
forbidden in ordinary spectra occur freely in the Raman effect. This 
is one reason for its great theoretical interest. According to wave 
mechanics, a Raman jump is possible between two atomic or molecular 
levels A and B only when there ezists at least one third level, C, such that 
ordinary radiative transitions are allowed belween A and C and between 
Band C. It is almost as if the atom or molecule actually jumped 
first from A to C and then from C to B. The relative probabilities 
of the various processes do not correspond to this simple picture, 
however, nor can it be said that the atom or molecule exists for any 
definite time in state C, as it does in the production of fluorescence. 

In light scattered by polyatomic molecules there may be Raman 
lines correlated in this manner with energetically possible lines in 
either the rotational or the vibration-rotation or the electronic spectra 
ofthe molecule. It does not matter whether these spectra can actually 
be observed or are prevented from direct occurrence by a selection 
rule. 

In the common type of rotation spectrum, the selection rule is 
AJ = +1. For the Raman lines associated with this spectrum, 
therefore, the selection rule will be either AJ = 0 (that is, J > J+Ł+1 
for jump from A to C and J + 1 —> J. from B to C) or AJ = +2. 
The ase AJ = 0 involves no change in the molecular energy and hence 
merely contributes to the ordinary, or Rayleigh, scattering. Thus, 
effectively, for Raman lines of purely rotational origin, we must have 


AJ = £2 


The incideat line, as seen in the spectrum of the scattered light, should 
be accompanied on each side by several lines spaced twice as far apart 
on the frequency scale as are the lines of the rotational spectrum 
itself. From Eq. (19.3) it is seen that the displacements of the Raraa” 
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mes from the incident line will be proportional to J(J + 1) — 
(J — 2)(J — 1), or to 2(2J — 1), J referring to the upper rotational 
level. Hence, since J > 2, the displacements are. proportional to 
6, 10, 14, . . . ; thus the distance from the unmodified line to the 
first Raman line is 1.5 times the mutual spacing of the Raman lines 
themselves. The origin of the lines is illustrated diagrammatically 
in Fig. 19.14. 

Besides these rotational Raman lines, there may also appear, at 
a much greater distance and usually only on the long-wavelength 
side, a band corresponding to the ordinary vibration-rotation spectrum. 
For this band the selection rule is easily seen to be 


AJ = 0 or +2 


Lines for which AJ = 0, involving almost no change in rotational 
energy, coalesce into an intense line in the approximate position of 
the missing central line of frequency vwy: or vio in the vibration-rota- 
tion band. On each side of this line there are much fainter lines for 
AJ = +2. In this band the vibrational change is from v = 0 to 
v = 1, as for absorption. In case the vibrational level for v = 1 lies 
so low that a considerable fraction of the molecules are maintained 
in this state by thermal agitation, a similar weaker Raman band 
should be observed on the short-wavelength side of the exciting line 
as well, associated with the transition from v = 1 tov = 0. 


Fig. 19.14 Diagram for a rotational Ramen 
spectrum. = 
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Many Raman spectra have been observed. They constitute a 
valuable source of information concerning molecular quantum states. 
The light scattered by HCl gas when strongly illuminated by light 
from a mercury arc in oe was studied by Wood and Dieke. They 
found a line at 4581.8 A which they interpreted as the single intense 
line in a vibration-rotation Raman spectrum excited by the Hg line 
at 4047 A. The frequency difference between these two lines is 
2886.0 cm~}, which is in excellent agreement with the frequency of the 
missing central line in the 3.5-4 band from HCl, viz., 2885.4 cm7'. 
Much closer to the exciting line and on both sides of it there were 
also a number of lines interpreted as a rotational Raman spectrum. 
In the corresponding spectrum adjacent to the mercury line at 4358 A, 
the measured spacings between the lines lay mostly between 41 and 
A2 cm—!, in good agreement with double the spacing in the ordinary 
rotation spectrum of HCI, as shown in Table 19.1. 

Even homonuclear molecules such as Oz and Nz give Raman 
spectra corresponding to vibration-rotation and rotational spectra, 
although the latter cannot be observed directly. Such molecules 
receive special discussion in Sec. 20.9. 


19.12 The Ammonia Inversion Spectrum 


The band spectra of polyatomic molecules are complicated. Several 
different modes of vibration of the molecule are possible, and if these 
modes involve oscillating electric moments, they give rise to observable 
vibration-rotation bands. The qualitative features to be expected 
in simple cases were worked out in 1927 by Hund. One feature of 
special interest is represented in the spectrum of gaseous ammonia. 

In the ammonia molecule, NH;, the nuclei may be thought of 
as lying at the corners of a low pyramid, N at the apex and H at the 
corners of an equilateral triangular base (Fig. 19.15). It is calculated 
that the N nucleus is about 0.36 A distant from the Hs plane, the 
NH distance being about 1.01 A and HH, 1.63 A; the HNH angles 
are about 108°. In a vibration of the molecule, the nuclei move only 
short distances from their positions of equilibrium. A system of four 
particles will have a total of 12 degrees of freedom, but 3 of these 
belong to their common center of gravity and 3 more correspond to 
rigid rotations, leaving 6 degrees of vibrational freedom. Because 
of symmetry, however, degeneracies occur, so that only four distinct 


vibrational frequencies are observed in the ammonia oaan 


namely: #1 = 3336 cm~! and ř: = 3414 cm-', the latter 
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both near à =3 m; 7s = 949.9 cm~ (10.5 a) and ř, = 1627.5 cm~: 
(6.2 p), the latter very intense. Inž, and #3 the nuclei move sym- 
metrically relative to the axis of the pyramid; for 2 and also for i, 
two independent modes exist, so that there are six modes in all. 

It will be noted that for given positions of the protons, the N 
nucleus may lie either on one side of the H; plane or on the other, as 
indicated in Fig. (19.15). These arrangements are related to each 
other as if by reflection in a mirror placed parallel to the Hy plane; 
one of them cannot be converted into the other by a pure rotation. 
They can be interconverted by a rotation plus interchange of two of 
the protons, Since all protons have the same properties, it follows 
in classical mechanics that these two forms of NH; are physically 
indistinguishable. In such situations, however, wave mechanics 
introduces a nonclassical distinction. 

' Consider the variation of the potential energy P of the molecule 
as N moves along a line perpendicular to the H; plane, x being its 


Fig. 19.15 The ammonia molecule 
ns ; NH with numbers between atoms 
3 wi > giving separations in angstrom units. 
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3 bya 
ely resembling that of Fig. 19.56. | 
ie barrier would remain high: | “However, 
move out and in, the barrier becomes low, per! ja ‘Jess 
` than 3000 ommi, -In classical mechanics the N atom c 
"be located in one or the other of the two “wells.” The wave fandio | 
for a quantum state, on the other hand, is in such cases either sym- 
metric or antisymmetric relative to the center of symmetry and thus 
| represents simultaneous equal probabilities of finding N in one well 
` or the other. If the potential barrier is very low or very narrow, the 
energies of the quantum states are widely separated, but when the 
| barrier is high and wide, they come almost together in pairs, a sym- 
"metric and an antisymmetric state close together. That they do not 
quite coincide is connected with the fact that, contrary to the classical 
| case, |y|? represents a small ‘probability, different for the two func- 
| tions, that N might be found inside the barrier, perhaps even in the same | 
| plane with H; “| 
In ammonia the result of this inversion effect is that each of | 
| the vibration-rotation states of the molecule is doubled. The lowest — 
| two levels, one of them being the ground state, lie only about 0.66 om! 
apart. (The next lowest pair of excited states are 35.7 cm! apart.) © 
! The effect on the band spectra is to convert all vibration-rotation or 
rotational lines into doublets, the selection rules being such that the 
doublet separation Av is the sum of the inversion separations in the | 
initial and final states. This effect was found in 1932 by Dennison 
and Hardy in the “v, band”; the vibration-rotation lines were observed ` 
as doublets spaced 1:3 cm! apart. Soon thereafter Wright and 
Randall found a split averaging 1.3 cm- in the purely rotational J 
| lines at 71, 83, and 1004, observed in absorption. | 
|... Radiative transitions between the lowest two molecular levels 
themselves. are possible, corresponding to a calculated frequency 
# = 0.66 cm or a wavelength à = 1.5 cm. This, the NH; inver- 
_ sion line, was found by Cleeton and Williams in 1934. They observed | 
a clear-cut peak in the absorption spectrum of NH; at à = 1. 25 cm 
- (0.8 cm). A predicted fine structure in the inversion line, caused by | 
| differences in the various rotational states of the molecule, was observed 
-in 1946 by W. E. Good, who found 30 inversion-rotatio 
ver & „range of 6000 Whee r 0.20. 
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Problems 


1. (a) Show that if point masses mı and m, are attached to the ends 
of a linear spring with spring constant k, the system vibrates at a fre- 
quency v = (1/2r) ~/k/m,, where m, = mım:/ (mı + m) is the reduced 
mass of the system. 

(b) Show that the moment of inertia of these two masses about 
an axis through the center of mass and perpendicular to the line of 
centers is given by m,R?, where R is the distance between the masses. 
2. From the data of Table 19.1 calculate the equilibrium separation 
of the H+ and CI- ions, assuming that the H+ ions are protons and that 
the chlorine ions are of mass number 35. The handbook value of the 
equilibrium separation in the ground state is 1.2746 A.» Why is your 
answer a little larger? Why does Ay decrease as J increases? 

3. From Raman spectra of hydrogen gas it is found that the fundamental 
vibrational transition v = 0 tov = 1 occurs at § = 4159.2 cm7. 

(a) Find the effective “spring constant’ (d?E/dR?)r, for H?. 

(6) Calculate the wavenumber of the corresponding: transition for 
D; (a hydrogen molecule composed of two deuterium atoms), for which 
the experimental value is 2990.3 cm~. 

Ans: 520 N/m 


4. Calculate the moment of inertia and the energies of the first five 
rotational levels for the KCl molecule. (Assume the Cl atom has 
A = 35.) What are the wavelengths of rotational lines arising from 
transitions between these levels? 

5. Estimate the repulsive energy term for RbCl, given that the equilib- 
rium separation is 2.89 A and the dissociation energy is 3.96 eV. (Assume 
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that the van der Waals attractive interaction is approximately equal 
to the vibrational zero-point energy.) 

6. (a) Assume that the ion-core repulsion energy has the form a/R" 
for KCl, where a and n are numerical constants. From the data of 
Sec. 19.3, obtain values for a and n. 

(b) Using the crude values from part (a), estimate the vibrational 
frequency for small displacements. (The experimental value is 8.34 X 
10? Hz.) 

7. Show that the frequency of the rotational line arising from a transi- 
tion J + 1 to J for a diatomic molecule lies between the classical rota- 
tional frequencies of the molecule in the upper and lower states, assuming 
the rotational kinetic energy of state J is given by Eq. (19.3). 

8. The force constant for the HCI bond is 470 N/m. Find the proba- 
bility that an HCl molecule is in its lowest excited vibrational state 
at 290°K. 

9. If the hydrogen molecule has a force constant of 573 N/m, assuming 
there were no anharmonic terms in the expansion for the potential energy, 
find the vibrational quantum number which corresponds to the dissocia- 
tion energy. 

10. Show that both the P and R branches of the Fortrat diagram are 
given by the same formula if K is allowed to take on negative values 
in Eq. (19.9). 

11. A molecule of LiH (lithium of mass 7) has an equilibrium separa- 
tion of 1.595 A and a vibrational frequency with a wavenumber of 
1406 cm-'. 

(a) Find the energy of the first excited rdtational state and the 
ratio of this energy to that required to excite the first vibrational state. 

(b) Compute the force constant for the vibrational frequency of 
the molecule. 


12. Of HCI molecules in the ground vibrational state (v = 0) compute 
the ratio of the number in the ground rotational state (J = 0) to the 
number in the first excited state (J = 1) at T = 290°K. (See Sec. 19.7 
for moment of inertia of HCI molecule; note that the statistical weight 
of each J state is 2J + 1.) 

Ans: about 0.4 


13. (a) Show that when kT = hy for vibrational energy states, roughly 
one-half of the molecules are in excited vibrational states. 

(b) Show similarly that when kT is equal to the minimum rotational 
energy, roughly half the molecules are rotating. (Do not forget the 
2J + 1 statistical weight.) 
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14. (u) Find the temperature at which T is equal to the energy of 
the first excited rotational state for He. 

(b) If the force constant for H, is 573 N/m, find the temperature 
at which kT is equal to the excitation energy for the first excited vibra- 
tional level. : 

(c) What rotational leve has energy nearest to that required to 
raise the molecule to the v = 1 vibrational state? 

(d) If J = 1 and v = 1, compute the number of rotations made 
by the H: molecule during one vibration. 

15. In 1929 Morse suggested an equation giving the energy of a diatomic 
molecule as a function of the internuclear spacing R in terms of the 
dissociation energy (or binding energy) Æ», the equilibrium separation, 
and one arbitrary constant a. The Morse function may be written 


E(R) = E, (e880 — 2e-e(R-R)) 


(a) Show that the Morse function gives E(R), = Ey and E(o) = 0. 
(b) Show that the vibration frequency for small displacements is 


a PE 
2x N\ m, 


where m, represents the reduced mass. 

(e) Calculate a for the Morse curve for Naz, given that E, = 0.75 eV, 
Ro = 3.07 A, and the wavenumber for the fundamental vibration 
transition is 157.8 cm~. 

Ans: (c) 0.84 À- 

16. The muonic hydrogen molecule-ion (p — »- — p) is known to 
exist. Assuming that the „— acts as a massive electron, estimate the 
interproton separation and the binding energy of the molecule-ion. 


chapter twenty 


Introduction to Quantum Statisties 


Just as the failure of classical mechanics made it necessary to establish 
quantum mechanics for the description of nature on the atomic scale, the 
failure of classical Mazwell-Boltzmann statistics to provide correct solutions 
for many problems involving fundamental Particles led to the development 
of new systems of quantum statistics. In the present chapler we reexamine 
the slalistical bases underlying the Mazwell-Boltzmann statistics and intro- 
duce modifications which lead to the Fermi-Dirac slalistics, applicable to 
all fundamental particles with spin one-half, and to Bose-Einstein slalistics, 
applicable to particles which have zero or integer spin: 


20.1 Statistics of Distinguishable Objects 


Suppose we have N distinguishable objects which we wish to place in 
order along the z axis. In how many ways can this be done? Clearly 
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there are N possible choices for the object nearest the origin, N — 1 
choices for the next position, N — 2 for the third, etc. The number 
of possible orders is 


N(N — 1)(N- 2) -->:> GQ) =! 


If we have s boxes, in how many different ways can these NV objects 
be placed in these boxes with nı in the first box, . . . and n, in the sth 
box, assuming that we are not concerned with the order of the objects 
in the boxes? For the first box there are [N(N — 1) +- (N— 
nı + 1)]/m!, where the denominator is the number of ways the nı 
objects could be arranged in the box. Similarly, for the second box the 
nz objects could then be selected in [(V-—m)(N-m—1)--:- 
(N — nı — m: + 1)]/n2! ways, ete. For all s boxes the total number 
of different ways becomes 


NWN 1) +++ (N-m+1) 


ny 
(N=m) +++ (N-=m—mtl) 
ne! 
m (Missal) Siecle N! 
n,! mins! = - > nl 


Consider an evacuated box of volume V which is arbitrarily divided 
into s subvolumes (Fig. 20.1). Let N idealized, distinguishable point 
“molecules” be injected. Here and throughout our development of 
distribution laws we assume that the molecules interact only very weakly 
with one another, so that only during brief collisions do they exert 


Fig. 20.1 A box of volume V arbitrarily divided 
into s subvolumes. 


(20.1). 


=o 
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substantial forces on each other. Thus we are dealing with an essentially 
ideal gas. The probability that any particular molecule is in the ith 
subvolume at any instant is given by V;/V, that any preselected pair is 
in this subvolume is (V;/V)?, and that any preselected ni molecules are in 


Viis(Vi/V)™. At any instant the probability that there are nı molecules 


in Vi, nin Va, ..., and n, in V, is therefore 
e N! Yaj VaN ENE 
P(ni,n, ... Ms) = PAETE SE a) (7) G) 


(20.2) 


1 
z 
=| 
E|- 
CT 
IS 
A 


If we sum © over all possible sets of values of nı, na, etc., subject 
to the condition that Zn; = N, we must, of course, get 1 as a result, 
since we sum over the probabilities of all possible distributions. Indeed 
by the multinomial theorem 


Vi iVar Varet VAN 
(+++ +7) 


= Ni V: ia ST a y, id = 
= X aera a K) =1 (20.3) 
al possible sets 


values ni...ns 


(If n; = 0, it is omitted; this can be taken care of by setting 0! = 1.) 

If N distinguishable but otherwise identical monatomic molecules 
were suddenly injected into an evacuated, perfectly reflecting container 
with a known arbitrary distribution of velocities, knowledge of the posi- 
tions and collision characteristics of the molecules would permit us in 
principle to apply the laws of classical mechanics to predict the future 
motion of each molecule. However, experiments with real molecules and 
containers show that, regardless of the injection mechanism, the memory 
of the initial state is completely obliterated in a very short time. Colli- 
sions soon produce the same equilibrium state for a given N and average 
energy, quite independent of the injection distribution. This equilibrium 
state is the state of maximum probability corresponding to the number of 
molecules N, total energy E, and volume V chosen. Every Measurement 
of a parameter of the gas made over a finite time interval leads to a 
time-average-over-the-interval value of the parameter in agreement with 
predictions based on the laws of probability. 

To find the equilibrium distribution of th 


e energies we imagine a 
one-dimensional space in which the energy of eac 


h molecule is represented 
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as an arrow with its tail at the origin and its length proportional to the 
energy (Fig. 20.2). We divide this energy space into a large number s 
of contiguous cells of sizes Ae, Aez, . . . such that all arrows which end 
in the first cell have average energy ¢; and so forth. Let nı arrows end 
in the first cell and n; in the ith cell. 

If E represents the total energy of the molecules, which are assumed 
to have negligible interaction energies, it is clear that the numbers n; 


Eg 


+10% 
other 
arrows 


r 


5 


E4 


ere | 


Fig. 20.2 At some instant the energy of 
every molecule is represented by an arrow 
in a one-dimensional energy space. 
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must satisfy the two equations 


Yom = NS + nes ap, (20.4) 
i=l 
> ne = E = ma + ne + >o +e, (20.5) 
S 


At this point in a classical treatment we introduce a result of 
quantum mechanics which proves useful. Specifically, we accept the 
facts (1) that the allowed energy states of a bound system are discrete 
and (2) that if any degeneracy exists, it can be removed by application 
of a magnetic field or in some other way. Assuming then a system of 
nondegenerate discrete energy states, we let gı represent the number of 
allowed energy states in cell 1 and g; the number in the ith cell. Then 
the total number of possible energies is 


Gagntet--- +n = Vai (20.6) 


In the absence of any exclusion principle or constant associated with 
the conservation of energy, each molecule might be in any one of the G 
states and the total number of ways N distinguishable molecules could 
be distributed among the states is GY. Each of these possibilities 
specifies a particular microscopic slale or microstate of the system. How- 
ever, if the molecules are all alike, we are ordinarily not concerned with 
which particular molecules have energies that place them in cell i but 
only with how many are in each energy cell. A statement that there 
are nı molecules in cell 1, ns in cell 2, and n; in cell i serves to specify a 
macroscopic slale of the system. If we now postulate that all micro- 
slates have the same a priori probability, each g; plays the role of V; in 
Eqs. (20.2) and (20.3) with G corresponding to V. Then the probability 
of each distinct macroscopic distribution becomes 


= 2 pN] g\™ ga\™ NI gin: 
P(m, ... ,n,) = hls ol E) Woz () = GF I ar 
Se anal 


(20.7) 


In the limit of continuous energy states, 
take g:/G to represent the probability th 
terminates in the ith cell. 


Eq. (20.7) is still valid if we 
at an arbitrary energy arrow 
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Boltzmann proposed that the equilibrium distribution of energies 
of the molecules in a gas is the one for which @(n;, . . . ,n,) is maximum, 
subject to the restrictions of Eqs. (20.4) and (20.5), which respectively 
require the conservation of molecules and the conservation of energy. 
He elected to work with In ®, which obviously is maximum when @ is 
maximum. Boltzmann proved that there is a logarithmic relation 
between entropy and probability; subsequently Planck proposed that 
the entropy S could be written as S = K In E, where K is a constant. 
Thus, as required by thermodynamics, the equilibrium configuration 
is that of maximum entropy. Clearly 


In@ = InN! — NInG — Inn! 
=": —Inn!+ ming +---+n,Inn, (20.8) 


According to Stirling’s theorem, when N is large, 
In N! = NInN — N + lIn V2rN + terms of the order oft 
(20.9) 


If all n’s are large compared to 1, we need take only the first two terms 
of the Stirling formula and write 


Ine =NinN- N= NinG + mIn# +m + T 
1 
+m ln a +n, (20.82) 
To find the maximum of In@ subject to the restrictions imposed by 
Eqs. (20.4) and (20.5) we use Lagrange’s method of undetermined 


multipliers. Let œ and £ be two as yet undetermined constants (Lagrange 
multipliers). Equations (20.4) and (20.5) can be written 


MO nm) -—N] =0 and e[( 3 ne) =E ]=0 
i=l i=l 
Combining these with Eq. (20.8a) leads to the result that 


me=NmnN-N-NmnG 5 Inf . : 
n + Dorin’ Y n 


i=l Pes 
TS (> Lig N) EB (y nes — E) (20.10) 
EA =1 


if and only if Eqs. (20.4), (20.5), and (20.8a) are all satisfied. 
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For In @ to be maximum, it is necessary that for each n; 


ane go ee Fa 
me =n l1+1l—a-—gpa= 0 (20.11) 


where n is the value of n; for which © is maximum. The distribution 
numbers ny, nz, etc., are of great importance because the number of 
distinct macroscopic states for them is vastly greater than the number 
for any other possible set of ns. Indeed, it is overwhelmingly likely 
that the gas will be found in a state characterized by n,’s virtually equal 
to the n?’s satisfying Eq. (20.11). Any other state is so unlikely that it 
does not persist throughout the: time of a measurement. Solution of 
Eq. (20.11) for n*/g; yields 

husle) = TÈ = Sh = ret (20.12) 


According to Eq. (4.19), the number of molecules n; in a small volume 
gi = dp, dp, dp, dx dy dz of phase space is given by 


N =e, 
ni = g; a emuIkT (20.12a) 


which is Eq. (20.12) if we identify 8 with 1/kT and e- with the constant 
in the brackets. That these identifications are proper can be shown by 
evaluating the constants a and £ by use of Eqs. (20.4) and (20.5). Thus 
the Maxwell-Boltzmann (M-B) distribution arises from the statistics 
of distinguishable particles. 

In classical statistical mechanics these ideas are applied to the 
molecules of real gases as though these molecules were distinguishable. 
The resulting M-B distribution function is applicable in many situations, 
but in spite of its successes, the M-B statistics failed to yield acceptable 
solutions to many problems involving electrons in atoms and in solids. 


20.2 Indistinguishable and Exclusive Particles 


In deriving the M-B distribution function above, 
the molecules are distinguishable. When two billiard bails collide, we 
can follow each ball and state with confidence, for example, that ball A 
came from —z direction, struck ball B, and then moved off in the y 
direction. However, when two electrons (or protons or identical atoms) 


it was assumed that 
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collide, we cannot in general follow the motions of each electron. If 
electron A comes from the —z direction and after collision with B, an 
electron moves eff in the y direction, we cannot tell whether it is A or B. 
Electrons are indistinguishable. Each electron has an associated wave 
packet, and once the wave packets of the two electrons overlap, we can 
no longer keep track of which electron is which. What happens while 
the wave functions overlap strongly is hidden in the fog of uncertainty. 
Thus billiard balls are distinguishable, but electrons, and indeed all 
identical fundamental particles, are indistinguishable. 

Further, electrons obey the Pauli exclusion principle (as do other 
particles of half-integer spin); on the other hand, photons, pions, and 
other particles of zero or integer spin do not. We shall call particles 
obeying an exclusion principle exclusive and the others nonexclusive. To 
see how indistinguishability and exclusiveness may influence the statistics 
of particles, consider the number of possible distinct ways in which two 
particles (n = 2) can be placed in three energy states (g = 3). 


CASE l. DISTINGUISHABLE, NONEXCLUSIVE PARTICLES Let the two 
particles be represented by z and o. The distinct ways in which these 


particles can be arranged in three energy states are shown below (without 
regard to order). 


The nine possibilities above are the g” expectations which have been 
used in deriving the M-B results. 


CASE 2. INDISTINGUISHABLE, NONEXCLUSIVE PARTICLES Since the 
particles are indistinguishable, we use an x to represent either one of 


them. There are now six distinct ways of arranging two particles in 
threé energy levels: 


Indistinguishable, nonexclusive particles clearly obey different statistical 
rules than those of Maxwell and Boltzmann. Bose and Einstein intro- 
duced statistics applicable for such particles, which are known as bosons. 
A glance at the example above suggests that particles which obey B-E 
statistics are more likely to be bunched together in the same energy 
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states than are bolizons; in the M-B case one-third of the possible arrange- 
ments involve two particles in the same level, while in the B-E case it is 
one-half. 


CASE 3. INDISTINGUISHABLE, EXCLUSIVE PARTICLES Let us rep- 
resent each particle by an z and assume the particular exclusion rule 
that only one particle can occupy any one energy state. Then the 
possible arrangements are 


The statistics for indistinguishable exclusive particles was developed by 
Fermi and Dirac. Such particles are often called fermions. Clearly, 
the introduction of exclusiveness prevents particles from bunching 
together in a common energy state. Thus B-E statistics leads to more 
bunching and F-D statistics to less bunching than that given by M-B 
statistics. : 

Before deriving the B-E and F-D distribution functions it is desir- 
able to introduce the following postulate which applies to all three of the 
statistics under consideration. 


The a priori probability of every physically distinct microscopic arrangement of N 
particles among the allowed energy slales is the same, provided the arrangement is 
consistent with the principle of conservation of energy and any exclusion principle which 
applies. 


For the case of M-B statistics we have found that Eq. (20.7) gives the 
thermodynamic probability of finding n, particles in the energy cell 
characterized by energy «, and so forth. We turn now to finding the 
corresponding probabilities for the B-E and F-D cases. 


20.3 The Bose-Einstein Distribution 


Consider the number of ways in which n indistinguishable balls can be 
apportioned among g boxes, allowing any number from 0 to n to be placed 
in any one box. This problem can be solved by a simple artifice. Imag- 
ine a linear array of g + n — 1 holes into which either black pegs (balls) 
or white pegs (partitions) can be inserted. Let there be n black pegs 
and g — l white pegs, which are just enough to provide g “boxes.” Then 
the possible distinguishable permutations of the pegs correspond to the 


number of ways in which the n balls could be placed ing boxes. Below is 
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one way in which 10 balls could be put in 6 boxes—with 2 in the first, 3 in 
the second, 0 in the third, 1 in the fourth, etc. 


KX Nex X X Kekex Nok XX yer 3 
2 3 ea 3 1 


The number of permutations of g +n — 1 distinguishable pegs is 
W+ n — 1)!, but neither the n black pegs nor the g — 1 white ones are 
distinguishable, so the number of distinct arrangements is 


(g+n—1)! 
nig — 1)! 


Now we return to the energy diagram of Fig. 20.2 and ask for the 
most probable distribution of N indistinguishable particles among the 
energy Cells. The probability that n, particles have their energy arrow 
terminating in the first cell, nz in the second, etc., is given by 


Mow leat Sir | 1 


nıl(gı — 1)! mlg, = 1I)! 
, Z 1 oD (20.13) 
where 
~ (9 +n: — 1)! 
be wai Taibo i eee 
In O(n, .. . n) = ý [In (g; + n: — 1)! — In n:! — Ing — 1)!] 
— In Cs 


Once again ye apply Stirling’s theorem and reimpose the lagrangian con- 


ditions a KS w) — N] = 0 and g [, $ (nei) — E] = 0 to obtain 


In O(m, - . . 4m) = D [@ + ni — 1) In (g: + n: — 1) 
= e+ m= 1) = nin n + n; — @: = 11n (œ: — 1) + (g: — DI 


— In Cg — a KG 7 n) — N] — g [C $ ne) — E] (20.14) 
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Again for the overwhelming most probable distribution a(In ©)/dn; must 
be zero for each n;, from which 


In g: +n? —1)+1—1—Innf-—1+1—a—- fe =0 
and 

gi — 1 = nF(ere®s — 1) 
Now if g; > l, we have 


PR n* a 1 i 

` ferla) = F pe edum] = Zer i (20.15) 
since again 8 = 1/kT. 

Comparison of experimental data with theoretical predictions reveals 
that B-E statistics are obeyed by photons, pions, a particles, and in 
general by all particles which have zero or integer spin. In quantum 
mechanics interchange of two bosons leads to no change in the wave 
function; in other words, the wave function is symmetric to the inter- 
change of bosons. 


20.4 The Fermi-Dirac Distribution Law 


The number of ways in which n indistinguishable particles can be dis- 
tributed among g energy states, subject to the exclusion condition that 
no more than one particle occupies any single state, is 


94 —1)@—2)--:-@—n+))___ gt 
nl = nig =n)! 


The number of possible ways in which n, particles can be placed in 
gı states, nz in gz states, etc., subject to the exclusion principle, is then 
the product of s terms like that above; and the probability of having this 
situation is 


gi! 
AEren 


P(r, . . . sm) = = 


where 


;1 
Cr = (ae a 
all 2g jar IG — m)l 
values of ny... ne 
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and 


In O(n, . - - Me) = y (In g:! — Innz! — In (g; — n:)!] — In Cr 


i=l 


Use of Stirling’s theorem and the lagrangian multiplier equations leads 
directly to 


ln = $ (9: In g: — g: — (gi — ni) In (g: — n) + (g: — ni) 


i=l 


— mlin +n] —InCr — a[(¥ m:) — N] 
i=l 


-e [(3 na) - £] 


from which 


dine 
(a ) atha- n-ti- mnt- itl 


—a— pe = 0 

or 

gi — nf = nere" 
and if we make use of 6 = 1/kT, 

w 1 
frole) = T = pmr F] (20.16) 
If we let « = —er/kT, we obtain 
eng 1 s 
fro(e) = T eT (20.16a) 


where the quantity ep is called the Fermi energy. 

Since n*/g; is simply the fraction of the g; states which are filled, 
it 1s clear that frp(«) is the probability that a state of energy «; is filled 
and that for a state with e = er the probability of occupancy is 3- 
Throughout our discussion of solids in Chaps. 21 to 23, we shall find the 
Fermi energy to be a topic of major importance. 

It is found that F-D statistics are followed by electrons, muons, 
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protons, neutrons, and, in general, by all particles of half-integer spin. 
Particles which obey F-D statistics are called fermions. Exchange of 
two fermions in the wave function y for a system leads to a reversal of 
the sign of y; thus the wave function for a system of fermions is antisym- 
metric in the exchange of a pair of particles. 


20.5 Comparison of the Distribution Laws 


In each of the distributions of Eqs. (20.12), (20.15), and (20.16) we have 
the ratio n*/g: given as a function of energy e Here nř is the expected 
number of particles, and g; the number of allowed energy states in the 
ith energy range of Fig. 20.2. The ratio n*/g; is called the occupation 
index for the states of energy «. As we saw above, for F-D slalistics 
frole) = n*/g; represents the probability thal the slate is occupied. No 
such simple interpretation is possible for M-B and B-E distributions 
because many nonexclusive particles can occupy the same state. ` 

The M-B distribution is plotted in Fig. 20.3 for three different 
values of temperature and of œ. The M-B occupation index is a pure 
exponential decreasing by a factor 1/e for every increase in e by kT. 
Although fire(«) itself is a function of T and & (where the latter depends 
on temperature, the number of molecules, and the distribution of possible 
energy levels), the ratio between farz(e1) and farp(es) depends only on: 
the difference in the energies divided by kT according to the relation 


fla) _ e—a kT = p—AdkT (20.17) 


T = 5000°K, a = —2.5 


> 3 
! T = 1000°K, a = —5 
Sa 
ž T = 10,000°K, a = -1.5 
1 
0 
0 1 2 3 4 5 6 


e eV 


Fig. 20.3 The Maxwell-Boltzmann distribution function as a 
function of e for three values of T and a. 
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Fig. 20.4 The Bose-Einstein occupation index for « = 0 at three 
temperatures. 


The B-E distribution function is plotted in Fig. 20.4 for a gas of 
photons. In this case a = 0, because the number of photons within a 
cavity is not constant; photons are being emitted and absorbed at the 
walls of an isothermal cavity at all times. (The fact that NV is not con- 
stant removes the restriction that a[(Zn;) — N] = 0, through which a 
was introduced. Removal of this condition is equivalent to setting 
a= 0.) For small energy e the occupation index is larger for the B-E 
distribution than for the M-B distribution, while at energies large com- 
pared to k7 the B-E distribution approaches the M-B form. 

In Fig. 20.5 the F-D distribution is shown for four values of T and 
a. At T = 0°K the occupation index is 1 for all states with an energy 
less than ep and 0 for all higher states. This comes about from fhe fact 
that at 0°K the system is in the lowest energy state, which corresponds 
to all particles in the lowest possible levels, As T is raised, the occupa- 


1. 0°K, a = -œ 
z 2. 1000°K, « = —37 
~ 3. 5000°K, a = —7 


4. 10,000°K, a = -2.4 


Ír- (© 


ea E > 6 = i 
Fig. 20.5 The Fermi-Dirac distribution function for four values of T 
anda. At the Fermi energy er the occupation index is 4. 
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tion index remains nearly 1 for energies well below the Fermi energy 
and almost zero for energies far above. A number of particles are excited 
to energy states a little above the Fermi level from states a little below 
this level. As T is raised to very high values, the Fermi energy decreases 
somewhat (this energy is that for which the occupation index is 3) and 
the distribution frp(e) gradually becomes more and more like the M-B 
distribution. 

Both the F-D and the B-E distributions approach the M-B dis- 
tribution when eze‘/*7 >> 1. When this inequality is satisfied, Eq. (20.17) 
is valid for both B-E and F-D distributions. In the sections which 
follow we shall cite examples of the applications of M-B and B-E statis- 
tics; numerous applications of F-D distributions appear in later chapters. 


20.6 The Specific Heats of Gases 


In Sec. 4.3 we discussed the failure of the classical equipartition of energy 
to explain the specific heats of gases. All major difficulties disappear 
when the internal energy of the molecule is described in terms of the 
quantum states that play such an important role in the theory of molecu- 
lar spectra. For a gas in equilibrium the occupation index of the various 
levels is given by the M-B distribution law (20.12), according to which 
the number of molecules in states with energy «; is given by 


ni = Cge~*l*? eos 


where C is a constant such that N = =n;. By eliminating C we may write 


Ne hi =el kT 
rae ss ele (20.19) 
gee E 


i=l 


These equations are applicable to the states of both atoms and 
molecules. We consider first an example from the realm of atomic 
physics. The sodium D lines result from transitions between 3?P} and 
32P, levels to the 32S, level (Sec. 17.3). Let us calculate the fraction 
of the sodium atoms in a bunsen flame at 1800°C which are in the 32P 
levels, remembering that the fine structure involves a very small energy 
difference compared with the energy separation of 32P from 32S, which 
is 3.36 X 107 J. The statistical weight of 3?P is 6 (4 for Py and 2 for 
P,) while that for 33S, is 2. If we let no represent the number of atom: 
in the ground state and n, the number in the 32P excited levels, we hav 
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for the ratio 


nı 


6 ote) /eT = 3e—(3-36X10-19/1.38%10-23X 2073) = 2.3 x 10-5 
No 2 


Thus only a small fraction of the sodium atoms at 2073°K are excited \ 
thermally at any given time. They suffice, however, to cause a con- 
siderable emission of sodium light. 

In a gas the molecules are distributed statistically among the 
quantum states according to Eq. (20.19), where « refers to the internal 
energy of the molecules. This internal energy may be associated with 
(1) rotation, (2) vibration of the atoms, or (3) electronic excitation. 
The total internal energy for 1 kmole containing N4 molecules is 


E; = $ nye; = Na 


j=l 


Eege- T 


Ige iT 2 
Egje TT (20.20) 


The translational kinetic energy E, is Ny X $kT [by Eq. (4.4)] or 
3RT/2. The total energy of the molecules is E = E; + E, and, since the 
specific heat per kilomole cy is (0E/dT)y, we have 


d Zege “ikt 
Priel Na aT a (20.21) 


It is now possible, in principle at least, to take the values of éj 
derived from a study of band spectra and to calculate the specific heat 
at constant volume from spectroscopic data by means of Eq. (20.21). 
This has actually been done with complete success in a number of cases. 

Equation (20.21) predicts the variation of specific heat with temper- 
ature. When T is very small, cy = 1.5R, the classical value for a mon- 
atomic gas. As the temperature rises, polyatomic molecules pass by ther- 
mal agitation into higher quantum states, beginning with the rotational 
levels associated with the lowest vibrational and electronic states. In 
general, the second rotational level lies from 0.0025 to 0.0075 eV in com- 
parison with kT = 0.025 eV at room temperature (17°C), whereas the first 
step upward in vibration requires 0.025 to 0.25 eV and the first electronic 
excitation requires at least 2 eV. It is clear, therefore, that at ordinary 
temperatures the molecules are widely distributed among the first set of 
rotational levels. Under such conditions the average rotational energy 
per molecule approximates the classical value, which is kT for a diatomic 
molecule or kT for a molecule composed of three or more noncollinear 

atoms. Thus we understand the approximate success of classical theory 
in dealing with the simpler types of molecules. 
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At higher tem.peratures, vibrational energy contributes appreciably 
to cy, eventually to the extent of kT for each vibrational level; at temper- 
atures well above 10,000°C contributions may be expected from the 
higher electronic levels as well. 


Comparison wilh Observation: Data on the specific heat of hydrogen, 
plotted as circles in Fig. 20.6, are in good agreement with the theo- 
retical predictions. Below 60°K, cy = 1.5R, whereas from 300 to 500°K, 
cy ~ 2.5R very closely, but from 600° upward vibration plays a part. 
The complete theory of a molecule composed of two similar atoms is 
more complicated than as here described. In the case of hydrogen it has 
been possible to verify experimentally the remarkable conclusions that 
follow from the theory outlined briefly in Sec. 20.9. 

In further illustration of the use of quantum theory, we consider 
chlorine. Whereas the second vibrational state (v = 1) for gases such 
as N: or H; lies far above the first (v = 0), for chlorine it lies only 560 cmt! 
above the first, as may be inferred from a study of the electronic band 
system that extends in the absorption spectrum from 4800 to 5800 A. 

The difference in energy between these two states is thus 


e — é = hv = 6.62 X 107 X 560 X 3 X 10!° = 1.110 
X 10-2 J 


At T = 288°K the ratio of the Boltzmann factors for the two states is 
em(ame)/kT = e~2.794 = 0.0611. Thus, 0.061 times as many molecules will 
be in the second vibrational state as in the first, The additional energy 
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Fig. 20.6 Specific heat of hydrogen at constant volume as a function of ab- 
solute temperature. 
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due to this vibration in 1 kmole of Cl: is Na(a — ele“) F7, The 
contribution of this energy to cy is 


= €0)? tear = R( 8)" tear = 
Na Gye emot = R( aa ) e-Cu AT = 0.48R 
Thus cy = (2.5 + 0.48)R = 2.98R, in fair agreement with the observed 
value, 3.02R. A more exact calculation using Eq. (20.21) gives 


cy = 3.06R 


When a similar calculation is made for HCl, whose vibration-rota- 
tion band at 3.4 u or 2886 cm! (Sec. 19.8) indicates that 


e — e = 5.73 X 10-7 J 


the ratio of the Boltzmann factors is found to be only 5 X 10-7. Thus 
molecular vibration can make no appreciable contribution to the specific 
heat of HCl. For Oz, e — eo = 1556 cm~, and vibration may con- 
tribute about 0.026R to cy. Other variations in cy may be due to 
quantum corrections on the rotational energy. 


20.7 Specific Heats of Solids 


Classical equipartition of energy led (Sec. 4.3) to the prediction that the 
specific heat per kilomole of solid elements should be 3R, a result verified 
by Dulong and Petit for many elements at high temperatures. How- 
ever, the prediction failed completely in the low-temperature domain 
(Fig. 4.1). A long step toward the explanation of the temperature varia- 
tion was taken by Einstein in 1907. He treated each atom as a three- 
dimensional harmonic oscillator of single frequency » behaving inde- 
pendently of its neighbors. Einstein replaced the classical value kT for 
the mean energy of an oscillator by the value derived by Planck, Eq. 
(5.18). Multiplying this mean energy ë = hv/(e**/#? — 1) by Avogadro’s 
number N4 and by 3, we have for the vibrational energy of 1 kmole 


3Nahv 


Eo = Sent — 


and for the specific heat per kilomole 


dE ebier 2 
Cr = AT = 3R | acne ays enriRT — 1)?\k (a ] (20.22) 
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At sufficiently high temperatures this expression reduces to the classical 
value, 3R; at low temperatures the exponential becomes large and cy 
decreases rapidly. The formula was found to fit the data fairly well, 
as is illustrated by the lower curvein Fig. 20.7. Atvery low temperatures, 
however, the roughly exponential shape of the curve does not agree 
with experimental points. Careful measurements show that cy varies as 
T? near 0°K. 

An improved approach was introduced in 1912 by Debye, who 
took into account the fact that the displacement of any one atom is 
intimately correlated with the displacement of its neighbors at any 
instant. Each atom exchanges energy with its neighbors by means of 
elastic waves propagated through the crystal. These traveling waves 
can be treated as particles known as phonons. Phonon energies are 
quantized in units of hv, just as photons are. Thus the vibrational 
energy of the crystal can be associated with phonons traveling through 
the lattice. 

Any solid is capable of vibrating elastically in many modes of 
different frequency. Debye calculated the number of independent modes 
of vibration in essentially the same manner in-which standing waves 


- Upper curve: Debye’s formula |- 
Lower curve: Einstein’s formula 
Circles: observed points | 


cy keal/(kmol) (°K) 


Absolute temperature, °K 


Fig. 20.7 Comparison of the specific-heat formul: instei 
with experimental data for aluminum with @ Maia and Dees 
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of an electromagnetic field within an enclosure were treated in Sec. 5.3. 
The principal differences are that (1) the number of modes of vibration 
in a solid are limited in number and (2) there are two kinds of elastic 
vibrations. There are transverse vibrations with two independent 
polarizations, in analogy with those for light; the number dn, of 
these modes per unit volume of the solid, given by Eq. (5.9), is 
dn, = g(v) dv = (8rv? dv) /v,3, where v is the speed of transverse waves. 
Then there are longitudinal, or compressional, waves of a single type 
traveling with speed v and having a density dn, = (4r? dv) /v;°. 

Now let us assume that the energy associated with each of these 
modes of vibration has the average quantum value as stated in Eq. (5.18). 
Integrating over all frequencies from the lowest, sensibly zero, to the 
highest possible vm and multiplying by the volume Vo occupied by 
1 kmole, we find for the thermal energy of 1 kmole 


Eo = 4r V, (è + a) Jo OT (20.23) 


The true value of vp is difficult to determine; hence Debye used an 


3Na = 4rVo (a + 3) Ji v? dv = $7 Vo (i + =) tae 


ur” n 
(20.24) 
By means of this result, Eq. (20.23) can be written 
9N4 fm hv 
By = h anra” d (20.25) 


The qualitative features of the formula are better seen if the 
variable of integration is changed from » to the di 
where z = hy/kT. 

Therefore 


mensionless variable z, 


y= AT, dy = ae 


It is convenient al 


so to introduce a characteristic Debye temperature 
© defined as @ 


= him/R, 80 that vm = kO/h. With these substitutions, 
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Eq. (20.25) becomes 


Tt per x8 
E =Rgh =e (20.26) 


and the specific heat is 


dE _ > ret x dz @ 1 
cr = Gr = 9R[4(6)" S ee ra (20.27) 


At. high temperatures, cy as given by Eq. (20.27) approaches the 
classical value 3R. At very low temperatures, on the other hand, the 
exponentials become large, so that the last term can be neglected; the 
upper limit in the integral can also be replaced by œ with little error. 
It is known that 


o atdz _ x4 
fa e—1 15 


Hence Eq. (20.27) becomes, approximately, 


cy = AgrtR () = 234R () (20.28) 


Thus, at temperatures much below the Debye temperature @, the 
atomic heat of a simple solid is proportional to the cube of the absolute 
temperature as confirmed by experiment. 

Variation of the specific heat as T? implies a finite value of the 
entropy (fey dT)/T at absolute zero—an example of the Nernst heat 
theorem or the third law of thermodynamics. According to classical 
theory the entropy of a solid at 0°K would be negatively infinite. 

For intermediate temperatures, it is necessary to evaluate the 
integral in (20.26) numerically. Good agreement with observation has 
been found for many substances when’ @ is chosen for best fit with the 
data and even when @ is calculated from the elastic constants with the 
use of Eq. (20.24). The data for many crystalline compounds can be 
fitted by adding one or more terms of the Einstein type to represent, 
in effect, internal vibrations of the atoms within each crystal cell, 


20.8 The Photon Gas 


Photons follow the B-E distribution law, 


and by applying the B- 
distribution to blackbody radiation we are ot bs Beedle 


led directly to the Plauck 
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radiation law. As we have seen in Sec. 20.5, æ is zero for a gas of photons, 
and so we have by Eq. (20.15) 


n? 1 
Ti = puri =a (20.152) 


In Sec. 5.3 we have followed Rayleigh in finding the number of degrees 
of freedom per unit volume for standing waves in an enclosure with 
frequency between » and » + dy to be (8ry2 dy)/c?, Eq. (5.9). Since 
the energy of a photon is given by e = hy, we have for the number of 
allowed energies per unit volume between e and e + de; the result that 


g = Seed de (20.29) 


Consequently the most probable number of photons per unit volume 
with energy between ¢; and e; + de; is 


Bre;? de; 
Me (gull 1) (20.30) 


Each photon has energy hv, and so the energy per unit volume 
associated with frequencies between » and v + dy is given by 


njyhy; Bre;? de; 8rhr? d, : 
U, dy = =F = Oe — 1) = aen (20.31) 


This is precisely Eq. (5.21) giving the Planck radiation law in terms 
of frequency. 


20.9 ‘ Homonuclear Molecules 


Molecules that contain at least two similar nuclei have certain pecul- 
iar and sometimes astonishing properties. Thus, it was early observed 
that in certain bands from such molecules alternate lines were weaker 
or perhaps missing entirely. After the advent of wave mechanics, it 
was shown by Hund that this phenomenon could be explained as an 
effect of nuclear spin. The effect is essentially statistical in nature, 
and in discussing it no account need be taken of the nuclear magnetic 
moments or of other minor forms of nuclear interaction, which serve 
only to produce, as in atomic spectra, a minute hyperfine structure- 
‘Diatomic molecules only will be considered here. 
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a. Ortho and Para States The wave function y for a molecule must 
contain as variables the coordinates of the nuclei and, in general, 
an allowance for nuclear spin. The spin numbers J and Mr were 
described in Sec. 18.8. All the known facts are consistent with the 
assumption that y is always symmetric in the coordinates and spins of 
any two similar nuclei for which J is integral or zero (the B-E type), 
whereas if 7 is half-integral (F-D type) the wave function is anlisym- 
metric, That is, interchange of the spatial coordinates and of the 
values of My for any two similar nuclei leaves y unchanged if J is inte- 
gral or zero but reverses the sign of y, as in the case of electrons, if I 
is half-integral. 

As with electrons, so long as spin energies can be ignored, spatial 


‘symmetry and spin symmetry can he treated separately. States for 


a pair of similar nuclei that are symmetric in the nuclear spins haye 
been called ortho states, those antisymmetric in the spins, para states. 
(Ortho means right or proper, i.e., ordinary; para, alongside of, i.e., a 
variant, the less common variety.) Since Mz has 27 + L possible dif- 
ferent values, there are, for two similar nuclei, (21 + 1)? possible 
spinstates in all. Of these, the states in which Mz has the same value 
for both nuclei are symmetric in the nuclear spins; there are 27 +1 
such states. Then, each combination of two different values of Mr 
yields both a symmetric and an antisymmetric state, in analogy with 
the aß combination for two electrons as described in Appendix 15A. 
There are (27 +1) such possible combinations. The number N, ` 
of antisymmetric, or para, states will thus have the value (27 + 1)/, 
whereas the total number of symmetric, or ortho, states is 


No = 20 FII + I +1) = QF +1) +1) 


“Thus we have the statistical ratio 


Neal : 
M TIFI (20.82) 


The ortho states have always the greater statistical weight; if Z = 0, 
the para states are missing altogether. 

The spatial symmetry of y relative to nuclear exchange, ie., for 
exchange of only the spatial coordinates of the particles, must then be 
considered. Sometimes this is referred to simply as the symmetry of 
y.: A wave function will have the required grand symmetry when [is 


“integral provided the spin factor by itself is symmetric and the spatial 


factor is also symmetric, i.e., remains unaltered when the coordinates of 


552 Introduction to Modern Physics N 


“the nuclei are interchanged; or, both factors may be antisymmetric. 
` Thus, for integral or zero J, the ortho states have spatially symmetric 
wave functions, whereas the para states have spatially antisymmetric 
functions. Conversely, for half-integral I, the ortho states are those 
with spatially antisymmetric wave functions, while the para states are 
symmetrical in the spatial coordinates. 
No general rules can be given for predicting with certainty the 
spatial symmetry of Y. Usually a good approximate form for y, with 
nuclear spin omitted, is 


y = Veobs 


where We Y», and y, stand for electronic, vibrational, and rotational 

_ factors corresponding to the division of the molecular energy into 
three parts as described in Secs. 19.8 and 19.9. The vibrational factor 
W» is always symmetric for nuclear exchange, essentially because a 
vibration of similar nuclei looks the same seen from either end. The 
rotational factor y, is even for even values of J and odd for odd J. 
But y., although often symmetric in states with A = 0, may be either 
symmetric or antisymmetric for nuclear exchange. Thus the ortho 
states are the even-J rotational states in some cases but the odd-J 
states in others. 

An important selection rule, AI = 0, exists in this connection, 
paralleling that for the different S states of atoms. If the nuclei had 
no spin-dependent form of interaction in addition to their coulomb 
fields, the ortho and para states would never combine at all; a molecule 

- in one type of state would remain in states of that type forever. Con- 
version from ortho to para or vice versa can take place, as a result of 
slight noncoulomb nuclear forces, but this process is very slow, requir- 
ing perhaps months or years. 


ò: Effects on Band Spectra The selection rule AJ = 0 effectively 
lir*ts spectral transitions to those between two ortho states or between 
ts ‘para states. Since a certain type of spatial symmetry accompa- 
nies each type of spin state, dependent upon the parity of J, it follows 
also that the spatial symmetry of y for nuclear exchange cannot change 
in a radiative transition. These restrictions result in two major pecu- 
liarities in the bands from homonuclear diatomic molecules. 

In the first place, since in a purely rotational: or vibration-rota- 
tional transition the electronic state of the molecule does not changes 
to preserve the symmetry of y the spatial symmetry (for exchange) of 
the rotational factor ¥, cannot change. Thus transitions of this kind 
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can occur only between two even values of J or between two odd: 
values, But for such bands there is also a spectral selection rule that 
AJ = +1. It follows that so far as dipole radiation is concerned, 
homonuclear diatomic molecules cannot have any purely rotational or 
vibration-rotational spectra at all! As was noted at the end of Sec. 19.8 
the absence of such spectra may also be regarded as due to the absence 
of a molecular electric moment in such molecules. In Raman spectra, 
on the other hand, frequency shifts corresponding to such transitions 
may occur, since for Raman lines the usual selection rule is that 


AJ = +2 


The second major consequence of exchange symmetry concerns 
the relative intensity of the lines composing a given band. Ordinarily, 
some of the molecules will be in ortho states while others are in para 
states, and in one of these states transitions wil! occur only between 
even values of J while in the other they occur only between odd values. 


. The relative intensity of the even-J and odd-J lines will thus depend 


upon the relative population of the ortho and para states. If a 
substance has been for a long time at a given temperature, it may be 
assumed that all fundamental states will be populated in proportion 
to a Boltzmann factor e~/47, Thus the intensity of a speciral line 
as observed will be proportional to (1) the Boltzmann factor for the 
initial state, (2) the transition probability, and (3) the nuclear statisti- 
cal weight of the initial state, which is proportional, for successive 
values of J, alternately to I or to I 4- 1. Since the first two factors 
do not usually vary rapidly from line to line, the observed lines alter- 
nate in intensity. If quantitative allowance can be made from theory 
for the first two factors, the magnitude of I for the nucleus can be 
inferred with certainty from measurements of the relative intensities . 
of the lines. Furthermore, if J = 0, there are no para states and no 
para lines at all, so that either the even-J or the odd-J lines are missing 
altogether. 

Many examples of these phenomena are known, and they all 
agree with the theoretical predictions provided the right value of I 
is assumed. Such observations are regarded as a reliable source of 
information concerning J. Thus, O+ has no rotation or vibration- 
rotation bands, and in its Raman spectra the odd-J lines are missing. 
It is concluded that I = 0 for the O nucleus. 

Perhaps the most famous case is that of IV; A photograph 
showing alternating intensities in a Raman spectrum from nitrogen 
jg shown in Fig. 20.8. The spectrum is a rotational one excited by 
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Fig. 20.8 Alternating intensities in a rotational Raman spectrum of Na. (From 
G. Herzberg, “Molecular Spectra and Molecular Structure,” vol. 1, “Spectra of 
Diatomie Molecules," 2d ed., D. Van Nostrand Company, Inc., 1950.) 


the mercury line at 2536.5 A, photographed by F. Rasetti. Careful 
measurements had also been made much earlier by Ornstein and van 
Wijk on four electronic bands of nitrogen (at 3984, 4278, 3884, and 
4237 A). After correcting for the Bultzmann factor and the transition 
probabilities, they found a ratio of statistical weights for even-J states 
to odd-J states very close to z. Equating 4 to.J/(7 + 1), we find 
I = 1 for the nitrogen nucleus. \ 

These observations were published in 1928, and at that time a 
value J = 1 for nitrogen was “extraordinarily surprising.” It was 
supposed then that the nitrogen nucleus must consist of 14 protons 
and 7 electrons and that an odd number of fundamental particles 
should require a half-integral value of J. The difficulty was resolved | 
after the discovery of the neutron had led to the view that the nitro- 
gen nucleus is composed of 7 protons and 7 neutrons. 


c. Effects on Other Physical Properties The different statistical 
weights of the ortho and para states must be taken account of in dealing 
with any physical property, such as specific heat, that depends upon 
the distribution of the molecules among their quantum states. The 
question may also be raised as to the possibility of a mass separation 
of molecules in the two States. 

These features have been studied extensively in hydrogen. This 
gas is a mixture of orthohydrogen with molecules rotating in odd-J 
states and parahydrogen in even-J States, the latter including the state 
of no rotation with J=0. Here I = 4 for the protons, and the elec- 
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tronic state is symmetric for nuclear exchange; hence odd J’s must. 
go with the ortho spin. If all fundamental J, Mz states occurred with 
equal frequency, the ratio of para to ortho would be I/(J + 1) = 4. 
At room temperature, it is. calculated that ordinary hydrogen should 
be about 25.1 percent para. In ordinary experimentation there is 
not time for appreciable change to occur in the para-ortho ratio; 
nevertheless, the difference in statistical weights should have some 
effect on the observed specific heat at low temperatures because of 
the unequal representation of the even-J and odd-J rotational states. 
A more remarkable fact is that it was found possible to accelerate 
the rate of interconversion between the two forms by adsorbing the 
gas onto charcoal, so that at 20°K it becomes almost pure para- 
hydrogen, in the state J = 0, within a few hours. In this way the 
special properties of parahydrogen could be studied, especially its 
specific heat as a function of temperature. and agreement with theoreti- 
cal predictions was found. The properties of pure orthohydrogen 
were then inferred by comparison with those of the normal mixture 
(Fig. 20.9). Orthodeuterium (H?) has also been prepared; here 
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Fig. 20.9 Experimental values of the specific heat at constant 
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I = 1, N,/No = 3, and it is the ortho states that have even values 
of J. . 


Problems 


1. (a) In how many ways can four persons occupy seven seats in a 
room? 

(b) A bridal party consists of six persons. In how many ways 
can a photographer arrange them in a straight line if the bride and groom 
always occupy the middle two places? 

Ans: (a) 840; (b) 48 
2. (a) In how many ways can a committee of 5 persons be selected 
from a department of 11 members? 

(6) A city council has 15 members, of which 9 belong to party A 
and 6 to party B. In how many ways cana 7-man committce be selected 
if the majority party must have 4 representatives? 

Ans: (a) 464; (b) 2520 
3. A manufacturer of blocks for children makes initially identical 
wooden cubes and then paints each face one of six bright colors. Cal- 
culate how many distinguishable blocks can be produced in this way. 
Ans: 30 
4. Find the number of ways in which two particles can be distributed 
in five states if (a) the particles are distinguishable, (b) the particles are 
indistinguishable and obey Bose-Einstein statistics, and (c) the particles 
are indistinguishable and only one particle can occupy any one state. 
Show the possibilities in a diagram for each case. 


Ans: 25, 15, 10 


5. Show that the Einstein relation for the heat capacity per kilomole 
of a solid reduces to the classical value of 3R when kT ® hv. 


6. Assume that there exists an infinity of energy levels equally spaced 
1 eV apart starting at 0. Assume further that there are three particles 
which share a total energy of 6 eV. Finally, assume that all possible 
arrangements are equally likely and that the system passes through al 
possible arrangements, spending an equal time in each. 

(a) If the particles are indistinguishable and exclusive (one per 
level), show all possible arrangements of particles in levels. What's the 
probability of finding a particle in level 0? in level 2) in level 4? in level 6? 

(b) If the particles are indistinguishable (but not exclusive), show 
all possible arrangements. From your sketch determine (i) the prob- 
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ability of finding all three particles in the same level, (ii) the probability 
of finding two particles in the same level, and (iii) the relative popula- 
tions of the various levels. 

(c) Repeat (b) for the case of distinguishable particles. 
7. (a) Calculate the probability that an allowed state is occupied if it lies 
above the Fermi level by kT, by 2kT, by 4kT, by 10kT, and by 100kT. 

(b) Prove that for a system obeying Fermi-Dirac statistics the 
probability that a level lying AE below the Fermi level is not occupied is 
the same as the probability that a level AE above the Fermi level is 
occupied. 
8. The dark absorption lines in the solar spectrum which were called 
C and F by Fraunhofer correspond to the first two members of the Balmer 
series of hydrogen. If the surface temperature of the sun is 6000°K, 
find the ratio of the number of hydrogen atoms in states with n = 2 to 
the number in the ground state for hydrogen at the solar surface. 
9. The Debye ‘temperature for carbon crystallized as diamond is 
2230°K. Calculate the heat capacity per kilomole for diamond at 20°K. 
Compute the highest lattice frequency vm involved in the Debye theory. 

Ans: 1.4 J/kmole-K°, 4.7 X 10! Hz 

10. At low temperature the specific heat of graphite is proportional to 
T? and is thus an exception to the Debye T? law. This has been attrib- 
uted to the fact that graphite has a layer structure with feeble forces 
between layers. Assume a two-dimensional plane lattice for graphite, 
show that the distribution of low-frequency modes goes as » dv, and 
derive the T? law. 


chapter twenty-one 


Solids—Insulators and Metals = 


The conditions of temperature and pressure at the surface of the earth 
are such thal a large number of materials exist in the solid state, in which they 
maintain both shape and volume. Tonic and covalent binding are dominant 
for some solids, but a different variation of the coulomb interaction leads to 
metallic crystals with high strength, ductility, and conductivities, both elec- 
trical and thermal. 


21.1 Crystals 


Some scientists prefer to reserve the term 
crystalline solid, Of a piece of glass, a bar 


y Means of X-rays 
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it is possible to determine the relative positions of various kinds of 
atoms in a material. Suppose for a moment that we could actually 
see the individual atoms in a piece of glass. If we locate a silicon atom 
and examine its neighbors, we find that they are not distributed in a 
random fashion. There is a definite geometrical order. However, if we 
look at other silicon atoms, we find that their neighbors form a somewhat 
different pattern. For glass there is no orderly arrangement which per- 
sists throughout the material; there is a shorl-range order but not long- 
range order. Sucha material which retains its shape is sometimes called 
an amorphous solid or a supercooled liquid. Crystalline solids exhibil long- 


range order; in a large single crystal the ordering extends uninterrupted 
over distances of many million atomic diameters. 


For discussing crystals it is convenient to introduce the concept 
of a space lattice, a network of straight lines constructed in such a way 
that it divides space into identical volumes with no space excluded. The 
lallice points are at the intersections of these lines. Associated with each 
lattice point there is either a single atom or an identical complex of atoms. 
For every crystal there is thus a network of lattice points for each of which 
there is either a single atom or a group of atoms called the basis of the 
crystal. 

‘In 1848 Bravais showed that there are precisely 14 possible space 
lattices (Fig. 21.1); of these we shall be concerned primariiy with the 
three cubic systems, which fortunately are the simplest ones. For any 
type of lattice there exist three fundamental translation vectors a, b, and c; 
these are the shortest vectors which satisfy the condition that if we go 
from any initial point in a crystal any integer mia distances, nob distances, 
and nc distances, we arrive at a point which has identically the same 
environment as the initial point. Thus any two lattice points can be 
connected by a translation vector T = ria + nob + nse. The simplicity 
of the cubic systems is associated with the fact that for them a, b, and c 
are equal in magnitude and mutually perpendicular. 

Once a, b, and ¢ are established, it is simple to use them to specify 
(1) the locations of any points within a crystal cell, (2) the direction 
of 2 iy vector, and (3) the orientation of any plane in the crystal. Thus 
(142) represents the point which is la, 4b, and 4c from the origin. For 
the body-centered cell of Fig. 21.1, atoms are located at (000), (100), 
(010), (001), (110), (101), (011), (111), and (333). d 
___, To specify the direction of a vector we translate the vector, placing 
its tail at the origin, and give the coordinates of the first lattice point 
reached by the line determined by the vector. Thus the y axis lies 10 
the [010] direction, while the [210] direction is in the plane formed by 
a and b, and corresponds to two a’s and one b. The body atom in a 


(9 sto pue q waaya 
5 q əjĝfue Əy L) 
$1129 pun euonusauos ƏY} IIV soul JOA 


Pe421099 sug 
x 


-suonejsuen ajqeyms Aq 39178] argo ue pimq 0} pasn eq ueo yorya 
vay ur paUtp NO sp9 YL *820%4B] ‘soLds 10 ‘sreavsg pI ONL LS PA 
aupouop 


adus | MPH, ppoyoquoyy 
x x 
Al we 2 z 
Z a j’ j . 
e g ə 4 43 d ay - a 
< ë 
3 : . B ‘A 
* 4 
06>4=7=0 * 
J#.06=4=0 24040 2#q#? 2=q=? 
ayquioysoyts0 
perues 2204 parnu aseg pes91ua2 Kpog adug euotexayy 
x 
= x 
ov a op z. te 
s . 
peer! a LIA oc 8 2021. a 
< R oaei 
2 < 3 2 
i & 
. Pi . . 
ea H i 0zr=4 p 
z see Petes 06=f=0 . 
06 =4= f=” 24q%0 q=» z 
[euomnay, aqng 
Parau ápog aydung pasaiuas 271 pezaiss <pog ajdung 
x x oor z 
v t v A] D 1 D =. 
, $ F 2 i 
6=4=9=2 k T> 
£ 24q=0 


562 Introduction to Modern Physics 


body-centered cube is in the [111] direction from the cube corner, and 
A As in a crystal are usually specified in terms of Miller indices. 
To find the Miller indices of a plane we proceed as follows: (1) we deter- 
mine the intercepts of the plane along the axes in terms of a, b, and c; 
(2) we take the reciprocals of these intercepts in order; (3) we find the 
least common denominator of the reciprocals and multiply each by this 
lcd. For example, for the plane with intercepts 4a, 6b, and 2c, the 
appropriate reciprocals are ł, %, and 3. The lcd is 12, and the Miller 
indices are written (326). When an intercèpt is at infinity, the corre- 
sponding index is zero; thus a (100) plane is perpendicular to the x 
axis, while a (001) plane is at right angles to the z axis. When one 
of the indices is negative, it is customary to place a minus sign above 
the index. Thus, if the intercepts are — 4a, 2b, — 2c, the Miller indices 
are (122). A plane with intercepts ła, 4b, © has Miller indices (220) 
according to the scheme above. Some scientists prefer to express the 
Miller indices in terms of the smallest three integers having the same 
ratio and would refer to this as a (110) plane. Since the (220) plane 
is parallel to all (110) planes, no confusion related to the orientation of 
the plane is introduced; retaining the (220) designation calls attention 
to the fact that the specific plane in question has intercepts 4a and 4b. 


21.2 Binding in Solids 


The individual atoms of a crystal are held in position by bonds which 
are fundamentally electric in origin but, except for the ionic bonds, 
arise through essentially quantum-mechanical considerations. In dis- 
cussing interatomic bonds, we shall treat bonding in terms of five idealized 
types: ionic, hydrogen, covalent, metallic, and molecular. It should be 
emphasized at the start that this is an arbitrary choice; many bonds 
partake of two or more of these idealized forms. 


a. Ionic Bond The ionic bond rises from the transfer of one or more 
electrons from an electropositive element to an electronegative element, 
thereby creating a positive and a negative ion. The binding energy 
then arises from the electrostatic attraction of oppositely charged ions. 
It typically amounts to several electron volts per atom. Tonic crystals 
are usually electric insulators which form conducting solutions when 
dissolved in ionizing solvents. They are often transparent, sometimes 
colored by characteristic absorption of one or both ions. „Many are 
hard and brittle with high melting temperatures. At sufficiently high 
temperature they conduct by diffusion of the ions. 
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b. Hydrogen Bond The hydrogen bond is a special type of ionic bond 
in which a hydrogen nucleus (H+ or D+) holds together two electro- 
negative atoms with a binding energy of tne order of 0.1 eV. Unlike 
most ions, H+ can have only two near-neighbor negative ions, because 
the proton has a radius of the order of only 10-5 m (all other ions have 
radii thousands of times this great). In the crude approximation in 
which each ion is regarded as a sphere and the solid is composed of 
spheres in contact, the proton can have only two negative ions touching 
it, while a typical positive ion has six or eight nearest neighbors. The 
hydrogen bond is important in ice, in hydrogen fluoride, and in many 
organic molecules, particularly the proteins. 


c. Covalent Bond The covalent bond in its idealized sense involves 
the sharing hetween two identical atoms of a pair of electrons of opposite 
spin. In a diamond crystal each carbon atom forms covalent bonds 
with four nearest neighbors. Covalent binding energies of a few electron 
volts per atom are common. The crystals are insulators or semicon- 
ductors, having high melting points. They are extremely hard and 
break by cleavage. Group IV elements in the periodic table come 
closest to forming ideal covalent bonds; III-V and II-VI compounds 
have bonds which are a mixture of covalent and ionic. 


d. Metallic Bond The metallic bond arises from the sharing of outer 
electrons by all the atoms of a crystal. In copper, for example, each 
atom contributes roughly one electron to the crystal as a whole. These 
free electrons, leaving the metal atoms in place as ions, serve as a pervasive 
“glue,” moving among the ions and binding them together. Because 
of the free electrons, metallic crystals are opaque and strongly reflecting; 
they are excellent conductors with resistance decreasing as the tempera- 
ture is lowered. Excellent thermal conductivity comes also from the 
free electrons. Typical metals have moderately high melting points; 
they are mechanically tough and ductile rather than brittle. The 
binding energy is usually a few electron volts per atom. 


or argon, with a spherically symmetric electron distribution has a small 
instantaneous electric dipole moment as a result of motions of the elec- 
trons. This produces at the location of a neighboring molecule a small 
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electric field which polarizes the latter. A weak attractive interaction 
results. The potential energy associated with the dipole moments is 
proportional to the square of these moments and to the inverse sixth 
power of the separation. Because the dipole moments are very small 
and ion-core repulsion prevents the molecules from coming very close, 
molecular binding is weak—of the order of a few hundredths of an elec- 
tron volt per atom. This small binding leads to very low melting and 


boiling points. The crystals are electrically insulating, insoluble, soft, 
and readily deformable. 


21.3 The Ionic Crystal 


Probably the simplest of all solids to understand are those composed of 
positive and negative ions, such as lithium fluoride and sodium chloride. 
The primary interaction between the ions is electrostatic and involves 
spherical charge distributions. As a specific example, we shall consider 
potassium chloride, since we can use information given in the description 
of the KCI molecule. In the molecule each ion is bound to a single ion 
of the opposite sign, whereas in the solid each ion has a number of nearest 
neighbors and it is no longer meaningful to pair each positive ion with a 
single negative ion. 

The crystal lattice for potassium chloride is face-centered cubic 
with one K+ and one CI- ion associated with each lattice point (Fig. 21.2). 


Fig. 21.2 The KCI crystal involves 
interleaved face-centered arrays of 
K+ and CI ions. 
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Alternatively we may think of it as the interleaving of face-centered 
arrays of K* ions and of CI- ions. We begin by calculating the binding 
energy of KCl, which we define as the energy released when K and Cl 
atoms combine to form the solid. As we saw in Sec. 19.2, it requires 
4.34 eV to remove the valence electron from a K atom to produce a 
K+ ion. Similarly, 3.80 eV is released when an electron joins a Cl atom 
to form a Cl~ ion. Thus the process of forming anion pair from neutral 
atoms requires 0.54 eV. The energy released when these ions are 
assembled to form a crystal may be calculated as follows. Consider the 
central Cl- ion of Fig. 21.2. If R represents the distance to the nearest 
K+ ion in the crystal, there is an electrostatic-interaction energy 
released of —e?/4me oR for each nearest neighbor, of which there are six. 
The next nearest neighbors are Cl- ions at distance V2 R; there are 
12 of them. Next there are eight K+ ions at a distance of V3 R. Then 
come six CI- ions at a distance of 2R, etc. The total lattice energy Ez 
is obtained by summing the potential-energy contribution from all the 
ions in the crystal and is given by : 

1 


MeL E elioni 12 E Te eS We ae l 
E, a(t a aren \- Treo 
(21.1) 


where æ is the value to which the series in the parentheses converges. 
(This series converges very slowly; however, schemes have been developed 
which give much more rapid convergence.) For the ionic face-centered 


crystal æ = 1.747558; «is known as the Madelung constant in honor of the 
man who computed it in 1912. 

The appropriate Madelung constant can be calculated for any ionic 
crystal; e.g., its value is 1.762760 for the CsCl structure (Fig. 21.3), which 
is a simple cubic array of Cs+ ions interleaved with a cubic array of Cl- 
ions. (Note that if all the atoms were the same in Fig. 21.3, the crystal 
would be body-centered.) 

When the ions are sufficiently close for their cores to begin to 
overlap, the Pauli exclusion principle again leads to a repulsive force 
which rises rapidly as R decreases. We may write the repulsive-energy 
term in the form Ez = A/R”, where A is an appropriate constant and n 
is typically of the order of 10. We can then write the approximate bind- 
ing energy per ion pair as a function of R for the crystal by combining 
the cost of producing an ion pair, the net lattice attractive energy and the 
repulsive energy term to obtain the relation : 


a ASP eth 14.40 A 
F(R) = (434 — 3.80) — 1.148 xP 4 Fo ey a9) 
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Fig. 21.3 The CsCl crystal 
involves interleaved simple 
cubic arrays of Cs+ and CI- 
ions. 


The sums of these terms for the solid behave very much as they did 
for the molecule and, except for scale factors, the curves of Fig. 19.1 are 
roughly applicable. For the KCI crystal. the equilibrium separation is 
3.14 A, and the binding energy is — 6.67 eV/ion pair. It should be noted 
that in this approximate treatment the vibrational energies of the ions 
have been neglected. 

Because the exclusion-principle repulsion is small until the ion 
cores begin to overlap and then rises extremely rapidly, the radii of ions 
in alkali halide crystals are reasonably well defined and one may approx- 
imate each ion by a sphere of fixed radius. For example, Pauling has 
shown that the grating spaces of many alkali halide crystals can be 
estimated by treating the ions as spheres in contact with the following 
assignments to radii in angstrom units: 

Ion Lit Na+ K+ F- 


Rb+ cl- 


0.60 | 0.95 


Radius, A 1.33 1.48 1.36 1.81 1.95 2.16 
Evidence supporting the ionic-nature of KCl comes from measuring 
the dielectric constant as a function of frequency. For static and low- 
frequency electric fields the dielectric constant of ionic crystals is sub- 
stantially larger than it is for optical-frequency fields. This arises from 
the fact that the ions are too massive to respond significantly to high 
frequencies. For low-frequency fields the ions have time to move; ionic 
polarization results when the applied frequency is less than the natural 
frequency of ion vibration. This frequency is typically of the order of 


1013 Hz. As examples, the ratio of the static dielectric constant to that 


<N 
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at optical frequencies for some familiar ionic crystals are LiF, 9.27/1.92; 
NaCl, 5.62/2.25; KCI, 4.68/2.13. 


21.4 Covalent Binding 


The covalent bond in solids arises, just as it did for molecules, from the 
sharing of a pair of electrons of antiparallel spin between neighboring 
atoms. The prototype of covalent bonding is that of the group IV ele- 
ments: carbon, germanium, silicon, and gray tin. These all crystallize 
in the diamond lattice (Fig. 21.4). Each atom has four nearest neighbors 
with each of which it shares a covalent bond (Fig. 21.5a). The space 
lattice is face-centered cubic with a basis of two atoms associated with 
each lattice point, one at (000) and the other at (444). The ground-state 
configuration for group IV atoms is s?p?. However, the p level lies 
very near to the s, and in the diamond lattice the configuration may 
be better described as sp’. Actually the four bonds in the diamond 
structure are of equal strength, and the angles between bonds are the 
same. This is perhaps best explained by assuming that the proximity 
of other atoms modifies the character of the s and p orbitals in such a way 
that their energies become degenerate. Then any linear combination 
of the wave functions is also a possible orbital; by a technique known as 


5 Fig. 21.4 If the sites marked by 
black and gray circles are occu- 
pied by the same kinds of atoms, 
the resulting crystal cell is typical 
of diamond, silicon, and germa- 
nium; if the sites are occupied 
by different kinds of atoms, e.g., 
zine and sulfur, the structure is 
that of zinc blendes. 
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Fig. 21.5 (a) Covalent bonds typical of atoms which crystallize 
in the diamond structure, (b) A hybridized sp? orbital. 


sp? hybridization it is possible to make combinations of the wave func- 
tions which result in four orbitals, each of which is directed toward a 
corner of a tetrahedron occupied by a nearest neighbor. An example 
of such an orbital is shown in Fig. 21.5b. 

The diamondlike covalent bond is a strong one and highly direc- 
tional. This leads to hard but brittle materials since the effective 
sharing of electrons can occur only in a few directions. It leads to a 
relatively loosely packed crystal in the sense that if a model is assembled 
from touching spheres, only 34 percent of the volume of the model is 
occupied by the spheres. In crystals with covalent binding the dielectric 
constant is almost independent of frequency from zero frequency through 
the optical region. For germanium, for example, the dielectric constant 
is approximately 16 while for silicon it is about 12 over this entire range. 

The diamond structure consists of two face-centered cubic lattices, 
the second displaced by one-quarter of the (111) diagonal of the first. 
Many semiconductor compounds crystallize in the zinc blende structure 
in which Zn atoms occupy the lattice points of one lattice and S atoms 
the other (see Fig. 21.4). Thus each Zn atom has four S neighbors at 
the corners of a regular tetrahedron and vice versa. In this case there 
are double bonds between atoms which are partially covalent and 
partially ionic. Among the many compounds which crystallize in the 
zinc blende structure are SiC (IV, IV); AIP, InAs, and InSb (III, V); 
ZnSe, ZnS, and CdS (II, VI); CuF, CuCl, and AgI (I, VII). 


21.5 Metallic Binding 


In metals the atoms no longer hold their outermost electrons, which, 
then form a Fermi gas of electrons, held within- the metal, but otherwise 
essentially free. It is these electrons which are responsible for the 
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excellent electrical and thermal conductivity of metals. As an example, 
we discuss sodium, which crystallizes in the body-centered cubic lattice 
with a nearest-neighbor distance of 3.71 A and a lattice constant of 
4.28 A. The probability of finding the 3s electron of sodium more 
than 4 Bohr radii from an isolated atom is substantial, as Fig. 15.4 
shows. But an electron that far away from its atom in the solid would 
be closer to some neighboring atom. As a result of the interaction due 
to near neighbors, the valence electron of sodium becomes almost free 
within a crystal; to a reasonably good approximation each atom of 
sodium contributes one electron to the crystal as a whole. A sodium 
crystal is composed of positive ion cores and free electrons. The energy 
is minimized for this arrangement, since the electron can be in the 
immediate vicinity of positive ions at all times and therefore have low 
potential energy, while at the same time its wave function is spread out, 
and this corresponds to relatively low kinetic energy. 

As we saw in Sec. 19.5, the effect of bringing one atom close to 
another is to split what was a single sharp energy level into a pair of 
levels. If there are many interacting atoms, each outer level is spread 
into a continuous band of levels, some of which lie lower and some higher 
than the original level. When sodium atoms form a crystal, the sharp 
atomic levels are spread into bands, as shown in Fig. 21.6. At the 
equilibrium separation the 3s level is spread out into a broad band with 


o- 


Interatomic separation, À 


Fig. 21.6 As sodium atoms are brought together to form a metal, 


the higher-lying sharp atomic energy levels are spread into bands. 
which overlap strongly at the interatomic spacing of solid Na. (A Jier 


Slater.) 
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the lowest energy states lying lower than the atomic 3s level. Mean- 
time, the 3p band overlaps the 3s, the 3d and 4s bands overlap 3p, and 
so forth. As a result, a free electron in the Na crystal can have any 
energy from that at the bottom of the 3s band upward. An extended 
range of levels from which electrons can gain energy from an applied 
electric field and move through the crystal is known as a conduclion 
bas All the alkali metals crystallize in the body-centered cubic lattice 
in which each ion has eight nearest neighbors. The Pauling radius of 
sodium ions is 0.95 A, but the equilibrium separation of nearest neighbors 
is 3.71 A. This had led Hume-Rothery to describe metallic sodium as 
“small Nat ions swimming in regular array in the electron cloud of the 
valency electrons.” In part it is because the ions occupy so small a 
fraction of the volume of the crystal that the electron-gas model (Sec. 
21.8) is particularly appropriate for the alkalis. 

Most other metals form face-centered cubic or hexagonal close- 
packed (Fig. 21.7a) crystals. These arrays give 12 nearest neighbors 
per atom, the largest possible coordinalion number of identical atoms. 
The face-centered cubic and hexagonal close-packed structures are inti- 
mately related, as one can see by study of Fig. 21.7b. If one starts with 
a layer A of identical spheres having six touching neighbors and a second 
layer B is added so that the spheres fit into the depressions, each sphere 
in the second layer is in contact with nine others, three below and six 
in the second layer. When a third layer is added, there is a choice 
which of two sets of depressions the spheres occupy. If they lie at the 
A sites directly above those of the first layer and the crystal is built 
up of layers ABABA - - - , hexagonal close packing results; if they 
lie at the C sites and the crystal is built of layers ABCABC :--:,a 
face-centered cubic crystal is modeled. These two arrays are the closest 
possible packings for equal spheres; in each case the spheres occupy 
74 percent of the total volume. For metals like copper and silver, a 
face-centered cubic array of touching spheres is a fair model for the ion 
cores. 

The metallic bond arises from the interactions of the ubiquitous 
free electrons and the ion cores. It is directionally insensitive, being 
almost equal in all directions. Further, as long as the ion cores are not 
too different in size, the electron glue serves equally well for various 
kinds of ion cores, which suggests why metals can be alloyed or joined 
together. The close packing favored by the metallic bond leads to 
structures which are dense and strong. The high ductility of metals 
can be related to the fact that it is relatively easy to make one set of 

planes slip relative to another. Close-packed planes can slip past each 
other with a minimum of interference and reach another stable position 
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(b) 


Fig. 21.7 (a) The hexagonal close-packed structure. 
(b) When close-packed layers are built up in the order 
ABABA - - - , hexagonal close packing results; stacking 


in the order ABCABC - - - leads to a face-centered cubic 
array. 


after a minimum displacement. Thanks to the nondirectional nature 
of the bond and its sameness for all the ion cores, the atoms of a layer 
displaced from one equilibrium position to the next can join up with 


a new set of neighbors as perfectly as before, thereby restoring the original 
structure of the crystal. 
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21.6 Metals and Insulators 


Metallic binding is the usual form for atoms with few valence electrons, 
which are ordinarily weakly held. In metals these electrons are free 
in the sense that they can move through the crystal with relative ease. 
Even at 0°K there are electrons in the conduction band of a metal (Fig. 
21.82); indeed, we may define a melal as a solid in which there are elec- 
trons free to move under the influence of an applied electric field at 0°K. 
In some substances, e.g., bismuth, the number of free carriers is very 
small compared to the number of atoms; such materials are often called 
semimetals. 

In contrast with the metallic case, at 0°K electrons transferred in 
ionic bonds or shared in covalent bonds are not free to gain energy from 
an applied field and migrate through the crystal. Although the sharp 
atomic levels are smeared out in the solid, all the allowed energy states 
up to a forbidden energy region are occupied (at least at 0°K) and there 
are no more electrons to be accounted for. The highest filled band, 
which includes electrons shared in covalent bonds or electrons transferred 
in ionic bonds, is known as the valence band. Even in excellent crystalline 
insulators there are conduction bands above the valence band into which 
‘electrons can be excited if they are given enough energy, but there is a 
forbidden energy region between the bands (Fig. 21.8b and c). When the 
forbidden energy gap is very small, thermal excitation may excite elec- 
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Fig. 21.8 Comparison of energy bands of (a) a metal, (b) a semiconductor, and 
\(e) an insulator. At 0°K electrons occupy all states indicated in black; energy 
States in gray are allowed but unoccupied at 0°K, Low-lying occupied states 
(such as K and L levels) are not shown. (From A. S. Smith and J. N. Cooper, 
“Blements of Physics,” Tih ed. Copyright 1964. McGraw-Hill Book Company. 


Used by permission.) 
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trons from the valence band to the conduction band, so that the material 
may be a conductor at elevated temperatures although a good insulator 
at0°K. A material with a small gap between the valence and conduction 
bands is known as a semiconductor. 


21.7 Metallic Conduction 


Although metallic binding is a quantum-mechanical phenomenon, as 
early as 1900 Drude proposed that in metals there are one or more free 
electrons per atom plus a crystal of positive ions. On this basis he was 
able to explain electrical and thermal conductivity semiquantitatively. 
In 1909 Lorentz applied Maxwell-Boltzmann statistics to this electron 
gas and obtained reasonable behavior so far as the optical properties of 
metals are concerned. However, his theory led to unsatisfactory pre- 
dictions for the temperature coefficient of resistance, for specific heats, and 
for the paramagnetic susceptibility. 

A primitive model of metallic conduction begins with the assump- 
tion that the conduction electrons behave in an identical manner on the 
average. When an electric field E is applied, the electrons are accelerated 
by the driving force —eE. The fact that an equilibrium current is 
quickly achieved can be explained by assuming that there arises from 
collisions with the ion core a net damping force which equals the driving 
force. Let this damping force be represented by —mvp/z, where\m is the 
electron mass, vp is the average drift velocity, and 7 is a constant we shall 


call the relazation time. For the steady state the driving and damping 
forces are equal, and so we have 


erE 


ii (21.3) 


If E is in the z direction, we may write J. = nevp, where J, is the 
current density and n is the number of conduction electrons per unit 
volume. If we substitute vp from Eq. (21.3), we have 


J, = “m Ez (21.4) 


Thus the assumption of a damping force prop 
drift velocity leads directly to Ohm’s law, 
scopic form. We see that o = ne*r/m for this model, in which we 
attribute the joule heating to the work done against the damping force. 

Suppose that a steady electric field E, has established a steady 


ortional to the average 
which is J = cE in its micro- 
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current density Jz corresponding to an average drift velocity vp(0). At 
i = 0 let us switch off the field. Then the electrons are decelerated 
according to d(mup)/dt = —mtvp/r, from which 


vp(t) = w(0)e** (21.5) 


The constant 7 is called the relaxation time by virtue of its dimen- 
sion of time and its role in Eq. (21.5); 7 is very small for metals, since in 
a noninductive circuit equilibrium is achieved very quickly. For exam- 
ple, if we take the experimental value e = 6 X 107 mhos/m for copper 
and assume that each atom contributes one free electron, we find that 
t = 2 X10" s. 

In discussing electrical conductivity it is convenient to introduce 
the mobilily of charge carriers. The mobility u is the magnitude of the 
ratio of drift velocity to applied electric field. Thus 


TAF E (21.6) 


The mobility of charge carriers can be directly measured as we shall see 
in Sec. 22.1. 


For any material the resistivity p is the reciprocal of the conduc- 
tivity, and by Eq. (21.4) 


1 m 
Pei Ener (21.7) 


In terms of the particle model of an electron, 7 is due to collisions with 
ion cores; from the wave point of view we think of the electron wave. 
packet scattered by phonons or by imperfections and impurities as the 
electrons pass through the crystal. In terms of either model one can 


express the resistivity for a typical metal approximately as the sum of 
two terms 


p=pitopr (21.8) 


where p; is the contribution due'to interactions with crystal imperfections 
and impurities and pz is due to the phonons (lattice vibrations associated 
with the thermal motions of the atoms). Equation (21.8) is known as 
Matthiessen rule and expresses the experimental fact that contributions 
to the resistivity from several sources are additive. (For example, for 
very thin films another term p, must be added for the contribution of 
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surface scattering.) The first term p; is essentially independent of tem- 
perature T, while pz is temperature-dependent. 

For many metals p varies with temperature roughly as curve A 
of Fig. 21.9. Near T = 0, pz is negligible. As T is raised, the periodicity 
of the lattice is disturbed by phonons passing through the crystal. 
Conduction electrons interact with these photons, which serve to remove 
energy from the electrons. For T small compared with the Debye 
temperature ©, pz varies roughly as the fifth power of absolute tempera- 
ture; at temperature high compared with Q, the lattice contribution is 
essentially proportional to T. For reasonably pure metals p; is small 
compared with pz at room temperature, and consequently p is roughly 
a linear function of T. For many alloys ø; is very great and completely 
dominant over pz. 

For more than 20 metals the resistivity falls suddenly to zero at 
some critical temperature, as shown by curve B of Fig. 21.9. This 
phenomenon of superconduclivily was discovered in 1911 by H. Kamer- 
lingh Onnes in experiments with mercury wires. For mercury the 
critical temperature T. is 4.15°K; technetium has the highest critical 
temperature (11.2°K) of any element. However many alloys have 
higher critical temperatures. For Nb;Sn, widely used for winding super- 
conductive magnet coils, Ts is slightly over 18°K. 


OO sL 4 5200 300 
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Fig. 21.9 The resistivity of a metal as a function of tem- 
perature. In the insert, curve B is for a metal which 


becomes superconducting and curve A for one which does 
not. 
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21.8 The Sommerfeld Free-electron Model 


Although the considerations of the preceding section predict, many 
reasonable results, they lead to entirely unsatisfactory predictions of 
such quantities as metallic specific heats. In 1928 Sommerfeld applied 
quantum theory and Fermi-Dirac statistics to the electron-gas model 
with great success. In this model one assumes that the sole effect of 
the ion cores is to neutralize the electric charge on the electrons them- 
selves so that they behave as though they were noninteracting. In 
spite of the artificiality of this assumption, the Sommerfeld theory 
predicts correctly many features of metallic behavior. 

Following Sommerfeld, let us think of the electrons as noninter- 
acting particles in a box, a problem which is solved in Sec. 13.7 for a 
cubical box of side L with the result that the allowed energies are, by 
Eq. (13.262), 


2 h2 22 
E = pa = Sait (n? + n + n?) = fn (21.9) 


where nz, ny, and n, are positive integers. 


, We wish to calculate the 
density of states g(E) for the box, where g(E) dE is the number of allowed 
electron states per unit volum 


e with energy between E and E + dE. 
The-number of values of Nz, Ny, and n; for which n lies between n and 
n + dn is (4)rn?/dn, as one can see by referring to Fig. 5.4. Since 
there are two spin states for each value of nz, ny, nz, we have for the total 
number of states per unit volume with n between n and n + dn 


OE Sonac (21.10) 
and, by use of Eq. (21.9), 
E) dE = = 8mL:E V8mE? dE _ 2mm 
I(E) dE = 7 Sane Ik gyp" p VEU œD) 


N(E) dE = g(E)f(E) dE = Ree a iis) 


where Ep is the Fermi energy. 
i 
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To calculate the Fermi energy Ep for a metal in the Sommerfeld 
theory, we consider the metal at 0°K when the electrons occupy the 
lowest possible energy states and f(E) is 1 for E < Ep and 0 for E > Er. 
Thus all states below Ey are filled and all above Ep are empty at 0°K; 
N(E) at 0°K is shown by the light solid line of Fig. 21.10. Now if n 
represents the number of free electrons per unit volume in the solid, 
it is clear that 


far 2irm! fEr _ 2txrmi 
n=, NŒ) dE = = J VE dE = S Er 


from which 


31h? = 
Ey = S ni = 3.65 X 10-"n? eV (21.13) 


For sodium we can readily find n if we assume one free electron 
per atom. The density is 971 kg/m?, and the atomic weight is 22.99, 
so that n = 6.02 X 1026 X 971/22.99 m~? and Ep = 3.16 eV. Similar 
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Fig. 21.10 The heavy dashed curve shows g(E), the number of 
allowed energy states per unit volume Per unit energy range at energy 
E. The light solid line designates the number NV (E) = 9(E)f(E) of 
occupied energy states per unit volume per unit energy range at 
0°K; light dashed lines indicate N(E) at higher temperatures. 
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calculations for other monovalent elements yield the results: 


Metal Li K Rb Cs Cu Ag 


Er, eV 4.72 | 2.14 | 1.82 | 1.53 

Even at 0°K many sodium conduction electrons have a kinetic 
energy more than 3 eV since the potential energy is zero in this model. 
For Na Er corresponds to an electron speed in excess of 1.05 X 10° m/s. 


The average kinetic energy of the free electrons at 0°K is given by 


Er Er 
: if EN(E) dE _ i EdE gE, _ 3 = one 
= “er ~ Er SY Py ite . 
h N(E)dE f E'de # 


When the temperature is raised, some electrons with energy near 
Er are raised to higher energy states; the N(E) curve then is shown by 
the dashed curves (b) and (c) of Fig. 21.10. At 290°K kT is only 0.025 eV. 
For an energy 1.1 kT greater than Ep, f(E) is 4; for 1.1 kT less than 
Er, f(E) = %. [Er continues to represent the value of Æ for which 
S(E) = 4.) Since only a small fraction of the free electrons gain energy 
as T is increased, and since those which do gain pick up only about kT, 
it is clear why the free electrons make only a small contribution to the 
specific heat at room temperature. 

We are now in a position to contrast the Sommerfeld model with 
that used by Lorentz, who assumed the average kinetic energy of the 
electrons was given by K = kT. At 300°K this is less than 0.04 eV, 
while the Sommerfeld theory predicts a kinetic energy of several electron 
volts. On the other hand, the older theory predicts a specific heat of 
R per kilomole for the electrons, while the newer model calls for two 
orders of magnitude less. 

Consider a copper conductor bearing a current. The average drift 
velocity is typically of the order of millimeters per second, while the 
average speed of the electrons is about 1.2 X 10°m/s. In the absence of 
an applied field, the average velocity is zero even though the average 
speed is great. A finite drift velocity in a metal is analogous to a wind 
in the atmosphere; air molecules and electrons in metals are in rapid 
motion at all times. A wind and a current result. when a small average 
drift velocity is superimposed on the random velocity distribution in each 
case. The mean free path à of an electron in copper is approximately 
7 times the Fermi velocity vp (the velocity corresponding to the Fermi 
kinetic energy Kr = Er, since P ‘= 0). Using the value of r from 


— 
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Sec. 21.7, we have 


he = 2 X 10-4 s X 1.6 X 10° m/s = 3 X 10-8 m = 300 Å 


Thus a typical electron may travel 100 atom layers between collisions. 

For a metal, or any other kind of Fermi gas, the Fermi energy 
depends somewhat on the temperature (see Fig. 20.5). However, as long 
as the Fermi energy at 0°K is large compared to kT, the change is very 
small; Er(T) is given by 


o T? kT P ..: 
Epr(T) = Er(0°K) È — lew | (21.15) 


Since Ep is several electron volts for metals, one must go to very high 
temperatures before the Fermi energy differs significantly from its 0°K 
value. 

Throughout this discussion of the free-electron model, we have 
assumed the potential energy to be zero; thus we have placed the energy 
zero at the bottom of the conduction band (Fig. 21.11). For sodium the 
Fermi energy lies at about 3.2 eV. It requires about 2.27 eV to take an 
electron at the top of the conduction band and move it an infinite distance 
from the crystal. This energy may be supplied by photons (photoelectric 
effect), by heat (thermionic emission), or by bombarding the crystal with 
electrons (secondary emission) or other particles. Raising the tempera- 
ture from 0°K to room temperature decreases the work function ¢ very 


Within metal _ Outside metal 


Levels empty a 
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Electron energy 
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Fig. 21.11 In the Sommerfeld free-electron model 
the potential energy is taken as 0 at the bottom 
of the conduction band and electrons have kinetic 
energies ranging from 0 to Er at 0°K. To escape 
from the metal an electron at the Fermi level needs 
an additional energy ¢, the work function of the 
metal. 
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little, since kT is small compared to ¢. However, at sufficiently high 
temperatures a significant number of electrons have enough energy to 
escape. 


21.9 Thermionic Emission 


As the temperature of a metal is increased, electrons near the Fermi sur- 
face are excited to higher energy states; eventually some have enough 
energy to surmount the surface barrier and escape. From Fig. 21.11 the 
energy required is Er + ¢, but for an electron to escape it must not only 
have at least this energy but must also arrive at the surface with its 
momentum p suitably directed. If we consider escape through a surface 
defined by the plane z = 0, the z component of the momentum must be 
equal to or greater than the critical value 


Pz = + /2m(Ep + $) (21.16) 


since the kinetic energy associated with the y and z components of the 
momentum does not aid in the escape. 

Let us now assume that all electrons which reach the surface with 
Pz > Pz: are thermionically emitted. Then the emission current density 
J, has the magnitude ev. times the number of electrons per unit volume 
with p- > pz. To calculate Jz we convert the energy distribution of 
Eq. (21.12) to a momentum distribution by using E = p?/2m, thereby 
obtaining for the number of electrons per unit volume with momentum 
magnitude between p and p + dp, 


8rp? d 
N(p) dp = MOAIE SN (21.17) 


To find the number of electrons per unit volume with momentum com- 
ponents between p, and P: + dpz, py and py + dp,, and P: and p: + dps, 


we note from Fig. 21.12 that, because the momentum distribution is 
spherically symmetrical, 


N(PssPy,P2) dp. dp, dp, = Fabs dbs N(p) dp 


= ___2dp. dp, dp, 


S Ree Tin-Ennt fy) (21-18) 


Only electrons with momenta to 


the right of the plane Pz = Pz in 
Fig. 21.12 can escape. We now find > 


the emission current density by 
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Fig. 21.12 Only electrons with p: > pz: can escape 
from the metal. Because the momentum distribution 
is spherically symmetric, the number of electrons with a 
momentum terminating in dp. dp, dp, is N(p) dp multi- 
plied by the ratio of the volume dp, dp, dp, to the vol- 
ume of the spherical shell 4xp? dp. 


evaluating 


° - © Pz 2dp- dpy dp. 
Jax fies ie -« iis Scan Meeri E fim ETET +I) 
(21.19) 


Since only electrons with energy greater than Ep + ¢ can be emitted 
and ¢ is many times kT, we may neglect the unity in the Fermi factor and 
write 


a z ° pae» Ikt dp, Ee e-Prtl2mk? dp, pe e-PH/2MkT dp, 


pre 
(21.192) 


These three integrals are standard ones evaluated in Appendix 4A; upon 


substituting the appropriate values, using pz. = »/2m(Er + ¢), one 
obtains 


= AoT%e-#/*7 (21.20) 
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This is the Richardson-Dushman equation for thermionic emission, in 
which Ao is a universal constant wit: value 1.2 X 10° A/m?-°K?, 

Some representative thermionic-emission data are given in Table 
21.1. The functional variation of J with T is well described by Eq. 
(21.14), but experimental values of Ao are considerably less than the 
value computed from the atomic constants for most metals. Of the many’ 
reasons for this discrepancy we mention two: (1) we have assumed that all 
electrons approaching the surface with p, > Pze escape, but we know 
(Sec. 12.2) that some electrons are always reflected at a step barrier; (2) 
the work function ¢ varies with crystallographic direction and for the 
polycrystalline wire ordinarily studied, the most copious emission comes 
from that fraction of the surface with favorable crystal orientation. 


Table 21.1 Thermionic-emission constants 


Emitter Ao, A/m*-°K? | ¢, eV 
Cr 4.60 
Cs. 1.8 
Mo 4.3 
Ni 4.61 
Ta 4.19 
X 4.52 
Ba on W 1.56 
Cs on W 1.36 


21.1¢ Metals in Contact 


When two dissimilar conductors are placed in contact, a contact potential 
difference appears between them. Typical values range from a fraction 
of a volt to several volts. Consider two metals, initially far apart, 
which have work functions ¢1 and ¢2 and Fermi kinetic energies shown 
in Fig. 21.13¢. When they are brought together, electrons can go to 
lower energy states by transferring from metal 1 to metal 2. Adding 
electrons to a material raises its Fermi level; removing them lowers the 
Fermi level. Electrons continue to move preferentially from metal 1 
until the Fermi levels coincide (Fig. 21.136), at which point equilibrium 
is achieved with equal numbers of electrons crossing the surface in each 
direction per unit time. (Equal Fermi levels correspond to equal chemical 
potentials from the thermodynamical Point of view.) . 

The transfer of electrons leaves metal 1 positively charged and 
metal 2 negative, thus establishing an electric-dipole layer at the suf- 
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Fig. 21.13 Two metals with different work functions (a) far apart and (b) in contact. 
When any two materials are in contact and in equilibrium, their Fermi levels coincide. 


face. The added energy an electron needs to cross this layer is the 
contact potential energy; from Fig. 21.136 this energy is ¢2 — ¢ı, and the 
corresponding contact potential difference AV is ($: — ¢:)/e. An elec- 
tron crossing from metal 1 to 2 gains kinetic energy at the expense of 
potential energy and vice versa. 

One way to measure the contact potential difference is to arrange 
two insulated parallel plates, one of each metal, connected to the termi- 
nals of an electrometer, with a bit of radioactive material so placed as 
to ionize the air between the plates. A zero electrometer reading is 
taken with the terminals connected together; then the terminals are 
disconnected, and a second reading is taken after the accumulation of 
charge on the plates has destroyed the electrostatic field between them 
and so stopped the flow of ions. The difference of the two readings is 
the contact potential difference, the plate showing a negative charge 
in the second reading being the one that is positive when the plates are 
in contact. The difference of surface potential thus defined depends 
greatly, however, upon the state of the surfaces. The potential differ- 
ence can be regarded as the algebraic sum of a potential jump between 
the metals themselves and other jumps, perhaps sometimes voltaic in 
origin, between each metal and whatever layer of oxide or other con- 
taminating substance is present on its surface. 


Problems 


1. Find the Miller indices for planes with each of the following sets 
of intercepts: (a) 6a, 2b, 3c; (b) a, 2b, œ; (c) 2a, —b, 2c. 

2. Imagine a crystal formed of identical hard spheres in contact with 
nearest neighbors. Show that the ratio of the volume of the spheres 
to the volume of the crystal is 0.74 for face-centered cubic and hexagonal 
close-packed, 0.68 for body-centered cubic, 0.52 for simple cubic, and 
0.34 for the diamond packing. 
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3. Show that the number of atoms per unit area of any crystal plane 
is proportional to the spacing between adjacent planes. 

4. Most of the alkali halides crystallize in the NaCl structure. Some 
of them have the following internuclear separations of nearest neighbors: 


Halide NaF | KF | LiCl | NaCl} KCI 


Internuclear separation, A 2.01 | 2.31 | 2.67 | 2.57 | 2.81 | 3.14 


Assuming a hard-sphere model for the ions and an ionic radius of 1.33 A 
for the F- ion, compute radii for the other ions of these crystals. Why 
does the hard-sphere model give reasonably consistent values when we 
know the solids are compressible and that the ions are far from hard? 
5. Find the radius of the largest sphere which can fit into the largest 
interstice for a simple cubic structure composed of identical hard spheres 
in contact. Do the same for face-centered and body-centered crystals. 
6. Show that in a cubic crystal the distance between adjacent plane: 
with Miller indices (hkl) is given by dı = a/(h? + k? + 12), where ¢ 
is the lattice constant. 
7. Show that in the diamond structure, with atoms at the corners ol 
a regular tetrahedron, the tetrahedral bonds make an angle of 109.5° with 
each other. H 
8. At about 1180°K iron transforms into a face-centered cubic structure 
from the body-centered cubic form normal at room temperature. Assum- 
ing there is no change in density, calculate the ratio of the earest 
neighbors distance in the face-centered modification to that in the body- 
centered form. 

Ans: 1.029 


9. Calculate a rough approximation to the Madelung ednstant for 
KCI by considering only the first cube of ions surrounding a central K+ 
ion and assuming that for an ion on any face one-half the charge belongs 
to the cube, for an ion on an edge one-fourth, and for a corner ion 
one-eighth. 
Ans: 1.45 

10. Show that the Madelung constant for a one-dimensional infinite 
line of alternating positive and negative ions is 2 In 2, or 1.386. 

11. If the cohesive energy of NaCl is 6.61 eV/ion pair and the inter- 
atomic spacing is 2.82 A, find the electrostatic-attractive energy and 
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the ion-core repulsive energy per ion pair and calculate an approximate 
value of n. 

12. From the fact that the density of copper is 8940 kg/m’, calculate 
the number of Cu atoms per cubic meter, the Fermi kinetic energy on 
the basis of the Sommerfeld model (one free electron per atom), the 
average kinetic energy of the free electrons, and the Fermi velocity 
(speed of electron having the Fermi kinetic energy). 


Ans: 8.47 X 1028 m~’; 7.02 eV; 4.2 eV; 1.6 X 10° m/s 
13. A piece of No. 12 copper wire carrying its rated maximum current 
of 25 A is at 50°C. At that temperature the resistance of a piece 1 m 
long is 0.0058 Q; its cross-sectional area is 3.3 X 107° m?. Assuming 
that there is one conduction electron per atom, find (a) the conductivity 
a, (b) the average drift velocity of the electrons, (c) the mobility p, and 
(d) the mean free path. (Use the results of the preceding problem.) 


14, The Fermi temperature for an electron gas is defined as the ratio 


of the Fermi energy / to the Boltzmann constant k. Calculate the Fermi 
temperature of the electrons for each of the metals for which an approxi- 
mate Fermi energy is tabulated in Sec. 21.8. 

15. Compute the Fermi kinetic energy for aluminum assuming that 
there are three free electrons per atom. (The density of Al is 2750 kg/ 
m.) Over what temperature range is this energy greater than 10kT? 
16. The potential energy of a pair of atoms in a solid can be written 
P(z) ~ Ax? — Br? — Cz‘, where the atomic separation is Ry + x and 
the zero of potential energy is taken to be the equilibrium configuration 
at0°K. Show that (x) = 3BkT/4A? when |Bz* + Cr‘| is small compared ` 
to Ax? and to kT. Hint: Take 


ifs ze- PNT dx 
39 
Gs oe 
f EWP (=) [kT dz 
tes 


expand small terms in series, and integrate. 


17. Show that the velocity distribution of electrons perpendicular to the 
surface of a thermionic emitter is maxwellian in form, i.e., is proportional 
to the number which would escape if electrons in the metal obeyed 
Maxwell-Boltzmann statistics and had no barrier to overcome. Show 
that for the escaping electrons in the vacuum the average kinetic energy 
associated with the x component of the momentum is kT while for the y 
and z components it is kT. 

18. Show that as long as the Fermi kinetic energy Kr of the free elec- 
trons in a metal is very large compared to kT, the molar heat capacity of 
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these electrons is 


a?*RRT 
(era = “oKy 


where R is the general gas constant and it is assumed that there is one 
free electron per atom. 


chapter twenty-two 


The Band Model for Metals 


In many respects the Sommerfeld model of a metal as a Fermi gas of 
free electrons provides amazingly good predictions of the properties of the 
alkali metals and reasonable ones for copper, silver, and gold. It leads 
direclly to acceptable values for the specific heats of these metals and to the 
Richardson-Dushman equation for thermionic emission. But the inlerac- 
tions between electrons and the ion cores are of transcendent importance for a 
quantitative understanding of properties of other metals and of alloys. A 
dramatic example of the failure of the free-electron model to account for the 
properties of certain metals is found in the Hall effect, the subject with which 
we begin this chapter. When the electron-latlice interaction is taken into 
account, we are led to several ideas which have proved to be of great utility. 


22.1 The Hall Effect 


‘Half a century before Sommerfeld’s free-electron theory and more than 
15 years before the electron was discovered, Hall performed experiments 
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designed to tell whether metallic conduction is due to positive or nega- 
tive carriers. In 1879 he passed a current through a thin sheet of metal 
with a pair of contacts C and D attached at opposite sides of the central 
equipotential surface (Fig. 22.1). When a magnetic field B, is applied 
normal to the sheet, the equipotentials are shifted and a potential differ- 
ence develops between C and D. Suppose for the moment that the 
carriers are positive and move in the z direction. The Lorentz force 
q(v x B) is then in the —y direction, so that the carriers are initially 
deflected downward. This makes the bottom of the sheet (and D) more 
positive, while the top (and C) becomes more negative. The surface 
charge builds up until the resulting electric field E, exerts an upward 
force equal on the average to the downward Lorentz force. Thus at 
equilibrium 


LE, + q(v x B) = 0 (22.1a) 
or 


Ey = vB, (22.18) 


where vz is the average speed of the carrier (and hence is to be identified 
with the drift velocity rather than the Fermi velocity). 

Now let us suppose that the carriers are negative. For a conven- 
tional current in the z direction, negative carriers move in the —z direc- 


Hall angle O B: out of paper 
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Fig. 22.1 Positive carriers moving in the z direction are deflected downward by a 
magnetic induction out of the plane of the paper. When equilibrium is established, 
there is an induced electric intensity in the y direction. If the current is due to 


negative carriers, the carriers are also deflected downward (dashed lines at right) ` 
and E, is in the —y direction. 
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tion, but the Lorentz force is still downward. In this case negative 
charge is deflected downward, D becomes negative relative to C, and E, 
is negative. Thus Hall reasoned that if the charge carriers in a metal aes 
positive, D should become positive relative to C when the magnetic field 
is applied; if the carriers are negative, C should become positive relative 
to D. 

We now define the Hall coefficient to be 


Ru = nn (22.2) 


where J; is the current density: Ra is positive for positive carriers, 
negative for negative carriers. The current density Jz is given by qnv:, 
where n is the number of carriers per unit yolume. If we substitute this 
in Eq. (22.2) and make use of (22.16), we have 


L 


a (22.2a) 


Ru = 


Thus a measurement of ‘the potential difference between C and D in 
the presence of a known field B. not only tells us the sign of the carrier 
but also predicts the number of carriers per unit volume, since |q| = e, 
the fundamental unit of charge. 

The Hall coefficients of several metals are listed in Table 22.1. 
Also tabulated is the number of free electrons per atom computed from 
Ry and the density of the metal. For the alkalis, and other valence-l 


Table 22.1 Hall coefficients and corresponding 
number of free electrons per atom at 
room temperature 


Free Electrons 
Metal Ry, m3/C per Atom 
es 
Na —2.5 X107” 0.99 
K —4.2 X 107 1.1 
Cu =0.55 X 107" 1.3 
Ag —0.84 X 107" 1.3 
Al —0.30 X 107° 3.5 
Be 42.4 107% —2.2 
Zn +0.33 X 10-7" —2.9 
Cd +0.60 X 107"? —2.5 
Sb 230 X 1071° —0.008 


Bi —1500 X 107" 3 X 10-4 
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metals, Hall measurements are reasonably compatible with the idea of one 
conduction electron per atom. However, the Hall coefficient is positive 
for beryllium, zinc, cadmium, lead, molybdenum, and several other 
metals. More than half a century had to elapse before an explanation 
was available in terms of the band theory of metals (Sec. 22.7). 

From Hall measurements we can obtain not only the sign of the 
carrier and the number per unit volume but also the mobility. One 
way to determine » is to measure the Hall angle 0 through which the 
equipotential lines are rotated (Fig. 22.1) by the magnetic field; this 
angle is given by tan@ = E,/E:. If we make use of Eq. (22.2) and 
recall that Jz = (nevz) = neuE. = oE», we obtain 


tan 0 = Ex = RuoB, = 4B, (22.3) 


The Hall angle is positive for positive carriers and negative for electrons. 
From measured mobilities one can calculate relaxation times and mean 
free paths. 

The Hall effect was first used to study metals, but it is an even 
more valuable tool for the study of semiconductors. Since the number 
of carriers per unit volume for semiconductors is small compared with 
the number for metals, Hall coefficients and Hall angles are correspond- 
ingly larger. 

Throughout this discussion of the Hall effect it has been assumed 
that only one kind of carrier is present, although for the metals with 
positive Hall coefficients and for many semiconductors this is not the 
case. When more than one kind of carrier is present, a more elaborate 
theory is required and Eqs. (22.2a) and (22.3) are no longer applicable. 


22.2 Electrons in a Periodic Lattice 


To g7 a further insight into the nature of metals and to understand how 
-some van have positive Hall coefficients, we must take into account the 
periodic fields which arise from the arrangement of ion cores in a lattice. 
While free electrons in a metal tend to smooth out the fluctuations in 
potential due to the ions, the assumption that the potential is constant 


throughout the metal is a gross oversimplification. Even though the 
free-electron model successfully predicts many metall 


lattice periodicity has far-reaching consequences. 
The solutions of the Schrédinger equation for a free electron in one 


ic properties, the 
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dimension are of the form (Sec. 12.2) 
y = Cet (22.4) 


where k is the wavenumber given by k = 2r/à = p/h. However, if 
the electron is subject to a potential P(x) which has the spatial periodicity 
a of the lattice, Bloch has shown that the solutions are of the form 


y = u,(x)e** (22.4a) 
where w(x) has the periodicity of the lattice, so that 
w(x) = u(x +a) = w(x + na) nan integer (22.5) 


The Bloch function of Eq. (22.4a) differs only by a periodic function 
from the free-electron function of Eq. (22.4). The effect of the lattice 
periodicity is to modulate the free-electron solution, since 


y(x + a) = u(x + ajet*Gt) = ¥(z)erite (22.6) 


The specific form of u(x) depends on the particular P(z) involved 
and on the value of the wavenumber k. For example, Kronig and 
Penney have solved the Schrödinger equation for a one-dimensional 
array of rectangular potential wells (Fig. 22.2). They found that for 
an electron in this periodic field, certain bands of energy are allowed 
and certain other bands forbidden. Without carrying through the 
calculations, we can see how such a result dévelops. For widely spaced 
levels of considerable depth, an electron is virtually bound in a well, 
and the eigenenergies are those for a single well. For this case we found 


Fig. 22.2 The potential energy for a one-dimensional array of identical square wells 
a distance a apart. 
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Fig. 22.3 The allowed energies for a one-dimensional array of square wells 
(Fig. 22.2) as a function of the wave vector k. The discontinuities occur at 
values of k for which Bragg’s law is satisfied. 


sharp energy levels (Secs. 13.3 and 13.4). As the wells are brought 
closer together, the wave function penetrates the barrier more and 
more. For two wells this gave rise (Sec. 19.5) to the splitting of a single 
energy state into two; for a large number of equally spaced wells it 
leads to the spreading of a single level into a continuous band, which 
becomes wider and wider as the spacing of the levels is reduced. Finally, 
when the barrier thickness is reduced to zero, we have a free electron 
in an infinitely wide well; all kinetic energies are allowed. In Fig. 22.3 
the solid line shows the energy as a function of k for an electron in a 
periodic potential such as that of Fig. 22.2. The dashed curve shows 
E asa function of k in the free-electron limit. 

Another approach to the idea of forbidden energy bands in a crystal 
comes from applying Bragg’s law to the wave properties of the electron: 
Consider an electron of wavenumber k = 27/\ moving along 4 one- 


dimensional lattice of spacing a. For this wave the Bragg angle 0 is 
90°, and Bragg’s law may be written 


2 3 
nv = T = 2a sin 90° (22.1a) 
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or 


nr 
k=% (22.7b) 


Thus an electron with wavenumber k = n/a experiences strong Bragg 
reflection as it moves through the one-dimensional crystal. An electron 
with wavenumber k = nr/a does not propagate as a traveling wave 
moving progressively in one direction but is represented by a set of 
standing waves arising from Bragg reflections. If we increase k gradually 
starting at k = 0, we find that electrons can pass quite freely through a 
periodic one-dimensional crystal until k approaches 1/a, at which point- 
strong Bragg reflection occurs. As we increase k further, traveling 
waves are allowed again, until at 2r/a they experience second-order 


Bragg reflection. 


22.3 Effective Mass 


For a free electron the kinetic energy K = p?/2m, and since p = h/à = hk, 
the E-vs.-k curve is the parabola E = h?k?/2m. Obviously the latter 
equation does not apply to the heavy curve of Fig. 22.3, which may be 
discussed in terms of an effective mass m*. The concept of effective mass 
is widely used in solid-state physics. When a free eleotron is subjected 
to an electric field E it experiences an acceleration a = —eE/m. How- 
ever, when the electric field is applied to a crystal, few electrons, if any, 
have the acceleration —eE/m. For example a copper K electron is so 
tightly bound to its atom that it is not accelerated at all; its “effective 
mass” is infinite. Foran electron which is not bound to any single atom, 
Newton’s second law gives 


ma = —eE + forces due to neighboring ion cores and electrons 


These latter forces we do not know quantitatively, but we can transfer 
our ignorance to the left side of the equation by writing 


peed i ana 4 (22.8) 
Let us estimate m* for an electron in a one-dimensional lattice, for 


which the E-vs.-k curve is shown in Fig. 22.3.. To do this, we invoke 
wave properties of the electron, which we assume moves with a group 
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velocity v, given by [see Eq. (11.5)] 


dw _2rdv _ 2rdE _ 1dE 


w= dk hdk idk (22.9) 


where we have made use of the fact that E = hv. Ina time ôl the electric 
field E does work on the electron, increasing its energy by 


> 


ôE = —eE», ôl (22.10) 
while by Eq. (22.9), ôE = hv, 5k. Therefore, 
5 hv, 6k = —eEv, at 


or 


-eE =t (22,11) 


Thus the external applied force on the electron is equal to A dk/dt when 
we are describing the electron in terms of the wave formulation. The 
acceleration of the electron is, by Eqs. (22.9) and (22.11), 


doy 1 GE 1@Edk 1GE 
= dl Rdkdi fide dl p dp OE 


Comparison with Eq. (22.8) leads immediately to 


i 

= pede (22.12) 
i \ 

tystal the effective mass is represented by a 


m* 


For a three-dimensional c 
tensor of the form 


oa 
* = 

mi; = SETH, ok) (22.12a) 

We now consider the behavior of the effective mass for the range 

0 < k < r/a in Fig. 22.4a. For small k the curve corresponds closely 


to the free-electron curve, so that m* =m. At 
hence v, reaches its maximum value; d*E/dk? 
infinite. Between A and B, d?E/dk? is negative, Corresponding to nega- 
tive values of m*. (As we shall see in Sec. 22.7, it is desirable to discuss 
this region of k space in terms of holes with positive charge and positive 
mass.) As kapproaches/a, v approaches 0 and m* is —m; one interpre- 


point A the slope, and 
is zero, and m* becomes 
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(a) 


Fig. 22.4 (a) E, (b) dE/dk, (c) @E/dk, and (d) 
m* plotted as functions of k for a periodic poten- 
tial function like that of Fig. 22.2. 


„tation is that when k is essentially x/a, applying an accelerating field 


leads to Bragg reflection. Thus the force on the elect 
leads to the electron’s gaining momentum in the o 
virtue of its interaction with the crystal, ‘the elec’ 
it had negative mass, R 


ctron in one direction 
pposite direction; by 
ron behaves as though 
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22.4 Brillouin Zones 


Consider a two-dimensional square lattice such as that of Fig. 22.5. An 
electron moving in the [10] direction experiences first-order Bragg reflec- 
tion from the (10) planes if its k vector has magnitude z/a. However, 
an electron moving at an angle 0 with the (10) planes undergoes first-order 


Bragg reflection from these planes when k reaches the critical value ke, 
given by 2r/k. = \ = 2a sin 6, or 


hs sand : (22.13) 


Thus the critical k depends on the propagation direction of the electron. 
For example, an electron moving in the [31] direction has sin @ = 3/4/10, 
so that kaa) = »/107/3a. When 8 is less than 45°, Bragg reflection 
occurs from the (01) planes for a smaller k. than that given by Eq. (22.13), 
viz., for k, = 7/(a sin ¢) = x/(a cos 6). 

In a two-dimensional k space let us represent k for each of many 
electrons in our lattice by an arrow with its tail at the origin (Fig. 22.6). 
Let us further draw the square bounded by kz = +7/a, ky = +x/a and 
thereby outline what is known as the first Brillouin zone for our two- 
dimensional crystal. Clearly, the.condition for Bragg reflection for 
the [10] direction is that the appropriate arrow terminate on the zone 


(31) planes 

DE 
-é e e KJ + 
aS e ei . ° 


Ld e e. 


Fig. 22.5- A two-dimensional square lattice 
with a vector in the (31) direction which is 


perpendicular to the (31) planes. 


7 
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‘Fig. 22.6 First, second, and third Brillouin zones for a two- 
dimensional square lattice. Any k vector which terminates on a 
zone boundary satisfies Bragg's law for reflection from some set 
of lattice planes. 


boundary; similarly, a kų; vector of length ~/10 7/3a (ks in Fig. 22.6) 
just reaches the boundary. In general, any k vector which terminales 
on the boundary of the first Brillouin zone satisfies the condition for Bragg 
reflection. 

An electron with a k vector such as k; or kz has too long a wave- 
length to suffer Bragg reflection from any set of planes. It can move 
almost freely through the crystal. An electron associated with k, has 
too short a wavelength for first-order Bragg reflection from (10) planes 
and too long for reflection from (11) or (01) planes. However, k; ter- 


- minates on the boundary of the second Brillouin zone, satisfying the 


Bragg condition for the (11) planes, for which the spacing is 
a/ VEFE = a/V3 


in the square lattice. This corresponds to a spacing /2/a in k space, 


since distances there are reciprocal to distances in the crystal; for this 
reason k space is sometimes called a reciprocal space. 


For two- and three-dimensional lattices, just as for the one-dimen- 
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(6) (e) 


Fig. 22.7 The first Brillouin zone in k space for (a) simple cubic, 
(b) face-centered cubic, and (c) body-centered cubic crystal. 


sional case, there are discontinuities in the allowed energies for electrons 
for each k vector that satisfies the condition for Bragg reflection from 
some set of lattice planes. In the positive quadrant of Fig. 22.6, this 
means reflection of any order from (10), (01), (11), (21), (12), (1), 
(13), etc., planes. [It is common to refer to second- and third-order 
reflection from (10) planes as first-order from (20) and (30) planes; in 
this nomenclature one always deals with first-order reflection.] If we 
draw all these planes in k space, they serve as portions of the boundaries 
of the various Brillouin zones. If we imagine a short k vector in any 
direction starting from the origin and growing steadily longer in time, 
this vector terminates in the first Brillouin zone until it reaches a plane. 
When it crosses this plane, it is in the second Brillouin zone until it 
reaches the next plane, and so forth. In Fig. 22.6 a k vector growing 
in the [11] direction would cross the (10), (01); and (11) planes at the 
point k, = x/a, ky = t/a. Having crossed three sets of planes at this 
point, it finds itself in the fourth Brillouin zone. Figure 22.6 shows the 
first three Brillouin zones for the square lattice; all the zones have the 
same area. We shall be concerned primarily with the first two Brillouin 
zones, 

Application of these ideas to three dimensions leads to the first 
Brillouin zones for simple, face-centered, and body-centered cubic lat- 
tices shown in Fig. 22.7a, b, and c, respectively. The first Brillouin 
zone is the region in k space bounded by planes such that any electron with 
a k vector terminating inside the zone has too long a wavelength to suffer 
Bragg reflection from any set of lattice planes, while one with a k terminating 
an the zone boundary has a wavelength satisfying the Bragg condition. An 
important conclusion brought out by the Brillouin zones is that the 
magnitude of k for which Bragg reflection occurs is not the same in all 
directions. Thus the energy at which forbidden energy gaps occur is 
a function of the direction in which an electron moves through the 
crystal. 
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22.5 The Fermi Surface 


It is possible to show the allowed energy values for electrons in a crystal 
in k space, since we have, through Fig. 22.3 (and the equations from which 
it was calculated), a relationship between k and the kinetic energy. 
For k vectors which are small enough so that they do not come close to 
Brillouin zone boundaries, the allowed energies are very similar to those 
for free electrons. If we join all points in k space which correspond to 
the same energy, we obtain a set of energy contours (Fig. 22.8). (Once 
again we discuss a two-dimensional cubic array, but the ideas are readily 
extended to three dimensions.) For small k’s, the constant-energy 
contours are circles in two dimensions (spheres in three). However, as 
k approaches a zone boundary, the corresponding energy increases very 
slowly (Fig. 22.3); consequently, the energy contours bulge toward the 
boundaries. Once the k vectors reach the zone boundary in the [10] 
and [01] directions, the corresponding energy contours terminate on 
the zone boundaries, which they approach at right angles. In Fig. 
22.8 the largest value of k and the largest energy allowed in the first 
zone are associated with the [11] direction and the corners of the lattice. 

In a crystal at 0°K electrons are all in the lowest possible energy 
states; they fill all states with energy less than or equal to the Fermi 
energy Ep(0), while all levels with Æ > Ep(0) are empty. In terms 


Fig. 22.8 Equal-energy surfaces in k space. At 


0°K all electrons are in the lowest 
states; all levels with energy less 
energy Er.are filled. The en 
with Ep is called the Fermi s 


possible energy 
than the Fermi 


ergy surface associated 
urface. 


i 
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of the Brillouin zone of Fig. 22.8, this corresponds to occupation of 
all k states which lie inside the equienergy contour corresponding to 
Er(0). This latter contour in k space is known as the Fermi surface. 
In Fig. 22.8 the boundary of the shaded region pictures the Fermi 
surface in the case in which electrons occupy somewhat less than half 
the states in the Brillouin zone. For small k’s, where the free-electron 
approximation is reasonable, the Fermi surface is spherical. As soon 
as any k vectors approach the Brillouin zone boundaries, the shape of 
the Fermi surface is modified. 

Also shown in Fig. 22.8 are a few energy contours in the second 


Brillouin zone. Because of the energy gap at the zone boundary, these 
contours are not continuous with contours in the first zone. In some 


cases the lowest allowed energies in the second zone lie below the highest 
energies in the first zone; in other cases the first zone is completely filled 
before any states in the second zone are occupied. When the lowest 
energy in the second zone lies below the highest in the first zone, the 
two zones are said to overlap. In Fig. 22.9 are shown E-vs.-k curves 
which lead to overlapping and to separate zone patterns. To see what 
this implies for the Brillouin zones, imagine a crystal at 0°K from which 
all conduction electrons have been removed so that the first Brillouin 
zone is empty. Now adding a group of electrons would lead to a Fermi 
surface like that in Fig. 22.10a. If we add more, the Fermi surface 


— kw — kno 


=- ki 


Sa ii) 


k—>a/a \/2r/a 

(b) 
Fig. 22.9 E-vs.-k curves for (a) nonoverlapping and (b) overlapping Brillouin 
zones for a two-dimensional square lattice. The H-vs.-k curves differ in different 
directions, with the maximum variation between the kpo and kun curves. In 


(a) the crosshatched range of energies is forbidden for every k direction, while in 
(b) every energy is allowed for at least some k direction. 
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Nonoverlapping 
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Fig. 22.10 When just enough electrons are present to fill the 
first Brillouin zone, inthe caseof overlapping zones some of the 


electrons spill over into an overlapping zone where lower energy 
states are available. 


becomes that in Fig. 22.106. When we have introduced enough elec- 
trons to fill the first Brillouin zone, the resulting Fermi surface depends 
on whether the zones are overlapping or not (Fig. 22.10c). If they 
overlap, some of the electrons occupy states in the second zone, because 
they lie below the highest levels of the first zone. 


22.6 Density of States 


We have approached Brillouin zones through the Kronig-Penney model 
of a periodic array of square wells. In each allowed state of a square well 
there can be at most two electrons with antiparallel spins. As we bring 
square wells together, each energy level spreads into a band of energies; 
with each band we associate a Brillouin zone. Thus, for each zone we 
may have a maximum of two electrons per square well; when we extend 
the theory to atoms, we associate each atom with a well and find that 
two electrons per atom fill each Brillouin zone. For example, in a sodium 
crystal of 1 cm? volume there are 2.4 X 1022 atoms, each of which con- 
tributes one electron to the first Brillouin zone for conduction electrons, 
Since the zone can hold two electrons per atom, it is half filled. : 
For electrons with k small enough to lie far from the zone boundary 
in Fig. 22.8, the electrons behave almost as if they were free, and Eq 
(21.11) represents a good approximation to the density-of-states function. 
As k approaches 7/a, a small increase in E leads to a large increase in the 


number of k states (see Fig. 22.3) and the density-of-states function 
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increases more rapidly than the free-electron curve (Fig. 22.11a), leading 
to a cusp. Once the zone boundary is reached, the only available 
states lie toward the corners of the Brillouin zone and the density of 
states decreases above the cusp, reaching zero when the zone is filled. 
When two Brillouin zones overlap as in Fig. 22.9, the density-of-states 
curves also overlap (Fig. 22.116). 

Thus far we have been discussing primarily the Brillouin zone 
associated with a simple cubic crystal. For other lattices the shapes 
of the Brillouin zones in k space are considerably different in detail, but 
most gross features are retained. In Fig. 22.12 the density-of-states 
function is shown for the body-centered cube and the face-centered cube. 
Of particular interest is the fact that the cusps lie at significantly different 
energies. 

Density-of-state curves like those of Fig. 22.12 play an important 
role in understanding alloy systems. An example is found in the copper- 
zinc alloys (brass). Copper crystallizes in the face-centered cubic struc- 
ture with one free electron per atom. When a small amount of zinc is 
added, the zinc atoms occupy Cu sites in the face-centered cubic struc- 
ture; however, each Zn atom brings two electrons to the Brillouin zone, 
{thus raising the Fermi level. If one assumes that the cusp for the face- 
centered cubic structure occurs when the Fermi surface (based on a 
free-electron model) first touches the Brillouin boundary, 


r l t one finds that 
the cusp is associated with 1.36 electrons per atom. This suggests that 


Free 
electron 


g(E) 


g(E) g(E) 


(a) (b) 


Fig. 22.11 The density of states g(E) 
(abscissa) as a function of energy Æ 
(ordinate) for the cases of (a) nonover- 
lapping bands and (b) overlapping bands. 
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Fig. 22.12 The density-of-states func- 
tion g(E) for face-centered and body- 
centered cubic crystals, 


when more than 36 percent of the atoms in the alloy are Zn, the Fermi 
energy should rise rapidly as more Zn is added. A similar calculation 
for the body-centered structure shows that the cusp occurs at 1.48 elec- 
trons per atom. Over the range from 1.36 to 1.48 the density of states 
is falling for the face-centered cubic but rising for the body-centered 
cubic structure. Thus, so far as electronic energy is concerned, the 
body-centered cubic structure becomes increasingly stable. Experi- 
mentally it is observed that brass changes from the face-centered cubic a 
phase to the body-centered cubic 8 phase at an electron concentration 


near 1.38. At higher electron concentrations the copper-zinc alloy 
system exhibits still other phases. 


22.7 Filled Bands and Holes 


If an electric field is applied to a crystal, electrons can achieve a net drift 
velocity with resulting current only if there are vacant allowed energy 
states available to them. For excellent conductors like copper, with one 
valence electron per atom, the first Brillouin zone is half filled, and there 
is a continuum of allowed energy states just above the Fermi surface to 
which electrons may be excited by the field. 

On the other hand, when a nonoverlapping band is fi 
energy states are accessible immediately above the Ferm 
fore, electrons cannot gain energy from the applied fiel 
there is no net drift velocity and no currert. 
full, a few electrons may be able to gain en 


lled, no allowed 
i surface; there- 
d. Asa result, 
If the band is not entirely 
ergy and produce a small 
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current density. Let us consider the case in which a single electron per 
unit volume is missing from an otherwise filled band. When this elec- 


tron, which we designate by i, is present, there is no current density 
and so 


eg = eya lens. = 0 (22.14) 


j=1 iži 


or 


—e J y = +ev (22.14a) 
ivi 


In the absence of electron i, the summation on the left is the current 
density due to all the electrons present. It is equal to the right side, 
which we may interpret as a current density due to the motion of a single 
posilive charge, which we call a hole. A vacant quantum state in an almost 
filled band acts like a positively charged particle, which is directly analogous 
toan electron. It is a great simplification to be able to consider a current 
to be carried by holes rather than focusing attention on the complex 
motion of a host of electrons. 

The hole concept presents us with a way of explaining the positive 
Hall coefficients observed for many metals. On the basis of band theory 
one might at first expect that elements with s? outer electron configura- 
tions, for example, Be, Mg, Ca, Zn, might be electrical insulators since 
they have just enough electrons to fill the s? atomic level and the corre- 
sponding band. However, in all these materials the p band overlaps 
the s band considerably, so that there are p electrons as well as an equal 
number of holes in the s band. To see that we may get a positive Hall 
coefficient when we have equal numbers of positive and negative carriers, 


we derive a relation for the Hall coefficient for the case in which we have 
n electrons and p holes per unit volume. In this case we have for the 
current density Jz (Fig. 22.1) 


Jz = —ne(vn)z + pe(d)z = (neun + peny)Es (22.15) 
where the subscripts p and n refer to holes and electrons respectively. 
If E, represents the y component of the transverse Hall field due to B,, 
holes carry net y current density (Jp), = peus[E, — (v,)2B.], while for 
electrons (Jn)y = nemn[Ey — (%,)2B.]. However, the equilibrium 


Jy 2 (G ie (Ja)y 
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must be zero, and therefore 


peusl[Ey — (vp)2Bz] + neprlE, — (vn):B.] = 0 (22.16) 
from which 


_ [Pup(e»)z + Mutn(tn)s]Bs _ (Pur? — nun')ESBs 22.1 
ByE Pup + Nen Php + Nun C2160) 


By definition, Ru = E,/J=B-; use of Eqs. (22.16a) and (22.15) yields 


l pup? — npr? 
RO 22.17 
m 6 (ou, Mun)? gago 


In the case of metals with an s? atomic configuration the mobility 
of the s-band holes is considerably greater than that of the p-band elec- 
trons. Since the number of holes and the number of electrons are equal, 
Ry is positive for these metals. 

The solid elements Bi, As, Sb, Se, and Te have primarily covalent 
binding, but the conduction band overlaps the valence band slightly, so 
that a small number of electrons go to the conduction band, leaving an 
equal number of holes in the valence band. Because of the small number 
of carriers, these solids are known as semimetals. Even at 0°K they are 
conductors, but not good ones. The sign of the Hall coefficient depends 
[by Eq. (22.17)] on which type of ¢arrier has the higher mobility; for Bi 
it is electrons, but for As and Sb it is holes. 


22.8 Transition Metals 


Several members of the group of metals with incompletely filled d 
subshells are of particular economic and scientific interest. Among 
| these transition elements are iron, cobalt, and nickel, which are ferro- 
magnetic; their outer electron configurations are respectively 4s%3d°, _ 
| 4s%3d7, and 4s*3d° for isolated atoms. However, in the solid state 
i the 4s level is spread into a very broad band (Fig. 22.13), which 
| strongly overlaps the much narrower 3d band. Many of the unique 
| properties of transition metals are associated with the overlap of the 
s and d bands. 
In metallic nickel, on the average, 0.6 electron per ator ie in 
ind and 9.4 in the d band. Most of the conductivity of Ni 
to the s-band electrons: in atomic language the d electrons are 
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| 


Lae gŒ) 
i Fig. 22.13 Schematic diagram 
| showing the overlap of the 3d and 
2 
E saahan i in metallic nickel. 


involved in bonds between atoms, while in free-electron language the 
d electrons have a high effective mass. Further, the s electrons accel- 
erated by an electric field undergo lattice collisions which may scatter 
| them into vacant states in the d band. As a consequence of the ease 
l | with which s electrons are scattered out of the band, nickel (and the 
-other transition metals) has higher resistivity than elements with. 
filled d shells; for Ni the resistivity is more than 4 times that for ci 
_ with its filled 3d band and one 4s electron. 

Of the average 9.4 electrons per nickel atom in the 3d Reeth 
roughly 5 have one spin and 4.4 the antiparallel one. Asa consequence 
of the 0.6 unpaired spin per atom, Ni atoms in a crystal have a net _ 
magnetic moment of approximately 0.6 Bohr magneton. Iron and 
cobalt atoms have higher average magnetic moments, viz., 1.7 uz for 
Co and 2.2u, for Fe. Heisenberg has explained the faroniaghétlc 
properties of these elements in terms of an exchange interaction 
between electrons which favors parallel orientations of the unpaired 
electron spins of neighboring atoms. Zener has proposed a different 
theory with an interaction similar to the one that favors unpaired 
electrons in atoms having parallel spins (Hund’s rule). 


Problems 


1. Ina Hall experiment on silver a current of 25 A is passed through 
a long foil which is 0.1 mm thick (in the direction of B) and 3 cm wide. 
Find the Hall voltage produced across the width by a magnetic induction 
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of 1.4 Wb/m?. If the conductivity of silver is 6.8 X 10-? mho/m, 
estimate the Hall angle and the mobility of electrons in silver. 

Ans: 29 pV; 8 X 10-* rad; 5.7 X 10-3 m?/V-s 
2. A conductor has 5 X 10" electrons and 8 X 102° holes per cubic 
meter. If pp = 0.05 and pa = 0.09 m?/V-s, calculate the conductivity 
and the Hall coefficient. 
3. From the Hall coefficient (Table 22.1) calculate the mobility of 
electrons in copper. From the mobility compute the relaxation time 7 
by use of Eq. (21.6) and show that the resulting value is compatible with 
the value of 2 X 10-'s quoted in Sec. 21.7. The latter was calculated 
by assuming 1 electron per atom, while the Hall coefficient suggests 1.3. 
How can one explain such good agreement in 7 under the circumstances? 
(The conductivity of Cu is 6 X 107 mho/m.) 
4. Draw a diagram in k space showing the first seven Brillouin zones 
for a two-dimensional square lattice. 


5. Draw a diagram showing the first three Brillouin zones for a two- 
dimensional rectangular lattice with b = 2a. 


6. Show that in a two-dimensional square lattice the free-electron 
model predicts that an electron with its k vector terminating at the 
corner of the first Brillouin zone has twice the energy of an electron with 
a kpo vector terminating on the zone boundary. In a simple cubic 
lattice what is the ratio of the lengths of the longest and shortest k 
vectors which reach the boundary of the first Brillouin zone? What 
is the corresponding energy ratio for free electrons? 

7. An electron moves in the [43] direction in a two-dimensional square 
lattice with a = 3.6 A. For this direction what is the distance in k 
space to the boundary of the first Brillouin zone? The second zone? 
The third zone? Calculate the energy of a free electron with each of 
these values of k. 

8. Assu:ne that the constant-energy surfaces for a simple cubic lattice 
are spheres until they first reach the boundary of the first Brillouin 
zone and that the cusp of the density-of-states function occurs at the 
energy ccrresponding to the first contact. Show that under these con- 
ditions the cusp comes when the zone holds 1.047 electrons per atom. 
9. Copper crystallizes in a face-centered cubic structure. Calculate 
the lattice constant a for the Cu crystal. Find the shortest k vector 
which reaches the boundary of the first Brillouin zone. To what free- 
electron kinetic energy does this correspond? (There are 8.47 X10?8 
atoms/m! in Cu.) 


Ans: 3.61 A; 1.61 A-?; 9.9 eV 
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10. Find the volume of the first Brillouin zone for a face-centered cubic 
crystal of lattice vector a. Calculate the length of the shortest k vector 
which reaches the zone boundary. Ifithe full zone holds 2 electrons 
and the cusp in the density-of-states nction occurs for the shortest k 
to reach the zone boundary, show that the cusp co 

a p comes at 1.36 electrons 


Ans: 32r?/a?; V3x/a 


chapter twenty-three 


Semiconductors 


Metallic conductors -have free carriers responsive to an applied electric 
field, even at 0°K. Indeed, for pure metals the conductivity is greatest at low 
temperatures, since the mean free path of electrons is least reduced by lattice 
collisions. On the other hand, in good electrical insulators virtually all elec- 
trons are firmly held in atomic or valence bonds so that the conductivity is 
extremely low. Between these extremes there is an increasingly important 
group of substances, the semiconductors, which have almost no free carriers 
at low temperature but which have, at room temperatures, a modest conduc- 
tivity that increases with rising temperature. 


23.1 Semiconducting Materials 


A semiconductor is a material with a filled valenc 


e band and 
conduction band at an empty 


0°K but with the energy gap between the bands 
609 
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small enougi- so that thermal excitation can create usable carrier densities 
at operating temperature. There are many semiconducting crystals, but 
silicon and germanium are of central importance in practice. Many 
semiconductors are binary chemical compounds. Some are composed 
of one element from the third and one from the fifth column of the periodic 
table. Such materials are called III-V compounds; examples are InAs, 
InSb, and GaP. Still others are II-VI compounds such as CdS, CdTe, 

and ZnS. These compounds frequently crystallize in the zinc blende 

structure (Fig. 21.4), which is identical with the diamond structure except 

that the two face-centered sublattices are occupied by different elements. 

One of the earliest practical uses of semiconductors was in rectifiers 

involving oxides of copper and aluminum. 


The valence and conduction bands of the pure covalent group IV 


3 states per atom 


3 states per atom 


R, Å —> 
Fig. 23.1 Band structure of silicon as a function of nearest-neighbor separation. 
At normal spacing Ro the 3s and 3p bands have crossed, and hess is an sera gap 
of 1.1 eV. The structure is similar for the 4s and 4p bands of germanium. (From 
H. D. Young, “Fundamentals of Oplics and Modern Physics.” Copyright 1968- 
McGraw-Hill Book Company. Used by permission.) 9 
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materials result from the hybridization of the valence orbitals of the free 
atoms (Sec. 21.4). In the crystalline state there is a mixing of the broad- 
ened s and p atomic levels in such a way that the eight (2s and 6p) 
states fall-into two bands (each involving four original states), separated 
by a forbidden energy region (Fig. 23.1). Ina crystal the lower (valence) 
band is filled at 0°K, while the upper (conduction) band is empty. The 
energy gap decreases with atomic number; for C (diamond) it is 5.3 eV, 
for Si 1.10 eV, for Ge 0.72 eV, and for gray tin 0.01 eV. (The precise 
width of the gap is different for k vectors in various directions; the values 
above aré approximate minimum gaps.) 


23.2 Intrinsic Semiconductors 


A chemically pure specimen of any semiconducting material has proper- 
ties which are characteristic of the material alone. Such a material is an 
intrinsic semiconductor in contrast to an extrinsic semiconductor, the 
conducting properties of which are intimately related to “impurities” 
deliberately introduced. At 0°K the energy bands of an intrinsic semi- 
conductor, shown descriptively in Fig. 23.2a, consist of a filled valence 
band and an empty conduction band, separated by an energy gap E,. 

At room temperature a small fraction of the valence electrons are 
thermally excited to the conduction band. The conductivity of the 
semiconductor is due to these electrons and to the equal number of holes 
left in the valence band. To find the number of electrons per unit volume 


Area under N (E) 
Conduction cares iF 
band Electrons 
ree r ~ |¢----4------ | 
— / 


N (E) = g (E) f (É) 


Eg = 
/ 
Ee m AERUS ES Holes 
lence 
haa Na (E) = [1 — f (E)] g(E 
Area under Nj, (E) 
0 curve = p 
(a) (b) (O) (a) 


Fig. 23.2 (a) Schematic diagram of band structure in a particular direction fi 
semiconductor; (b) the density of states g(E); (c) the Fermi function JE); and (d) 
the functions N (E) and N,(E) for the semiconductor of (a). 
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in the conduction band.at temperature T, we need to know the density- 
of-states function for ‘the band and the F ermi ‘energy Ep. Since we 
shall always be dealing with the situation in which the conduction band 
has few electrons compared to its capacity, it is a reasonable approxima- 
tion to assume the free-electron functions of Eq. (21.11) (Fig. 23.20) 


g(E) dE = eee VE— Ede 


(23.1) 
where we have taken cognizance of the fact that the bottom of the con- 
duction band is now at E, rather than at 0 and we have replaced the free- 
electron mass m by the effective mass. The probability that any state 
in the conduction band is occupied is given by the Fermi function {Eq. 
(20.16a) and Fig. 23.2c], and the number of electrons per unit volume in 
the conduction band with energy between E and E + dE is (Fig. 23.2d) 


2irm VEE, 
N(E) dE = g(E)f(E) dE = —— zarg AE (23.2) 


where the Fermi energy Ey remains to be found. 
To obtain the number n of electr 
tion band, we must integrate N(E) dE 


ba © 2irm*i 
ha Je N(E) dE = ip ya (E — E,)ie@-en er dE 


2irm*t © 
= ae en (EEn) kr S (E — E,)te-@-2007 gp 
= (it RETEA (23.3) 


Here we have made use of t 
semiconductors in dropping 
have the useful relation 


he fact that E, — Ep is many kT for useful 
the 1 in the denominator. If m* = m, we 


n = 4.8 X 102 Tle-G-2,) Ikr m-3 (23.3a) 

and, for T = 300°K, n = 2.5 X 10256-2; 
To find the Fermi energy Ep, 
cubic meter and equate it to n, 


ERT py —3_ 


we compute the number of holes per 
The probability that an allowed state 
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at an energy E below Ep is unoccupied is given by 
1 — f(E) = 1/6850 +1) 


We now assert, appealing to Fig. 22.10, that the density-of-states: func- 
tion at the top of the valence band is of shape similar-to that at’ the 
bottom of the conduction band (Fig. 23.26), described analytically by. 
substituting VE, — E for VE — E, in Eq. (23.1) and the appropriate 
effective mass m, for m*. If N,(E£) dE represents tlie number of holes: 
per unit volume with energy between E and E + dE andip is the number 
of holes per unit volume, we have; imclose analogy. with Eq: (23.3); 


p= p N; (E) dE = ai s/E, — E erer DIE qE 


yi i 
= 2 (Zen en (BB) kT (28.49) 
From p: = m for an intrinsic semiconductor, 


mpe- (Er -EJIE = mg Ei ERIT 


_ Ey +E 3k, m 
See eae ee 


For silicon, germanium, and many other semiconductors the last 
term of Eq. (23.5) is small compared with the first, and an error of only 
a few hundredths of an electron volt is introduced by neglecting it. 
For such a semiconductor the Fermi level lies at the middle of the energy 
gap belween the top of the valence band and the bottom of the conduction 
band. Therefore, if E, is as much as 0.5 eV, eE EAIRT and e(Fr-S0lkT 
are at least e?; since e° > 1, dropping the 1 in the Fermi functions above 
is well justified. 


23.3 Conductivity 


For germanium E, = 0.72 eV, and at room temperature, where 
kT = dy eV, n and p are approximately 2.5 X 10257144 = 3 X 10! m~, 
The number of atoms per unit volume is 4.45 X 1078 m-, so that there 
is one electron in the conduction band and one hole in the valence 
band for every 1.5 X 10° atoms (compared with one electron per 


614 Introduction to Modern Physics 


i i duc- 
in copper). In spite of the small number of carriers the con 
cae Ge is 2 mhos/m compared with 6 X 107 for Cu. The con- 
ductivity is given by [see Eq. (22.4)] 


o = nepn + pepp (23.6) 


primarily from the fact that the mobility of electrons in Ge (~0.4 m?/V-s) 
is about 100 times that in Cu. The mobility of holes in Ge is roughly 
0.2 m?/V-s, so that electrons carry about two-thirds of the current 
while holes carry the remainder. 


One method of determining the energy gap E, in an intrinsic 
semiconductor is to measure ø as a function of T over a broad range. 


From Egs. (23.3a), (23.4), and (23.5) we have the approximate result 
that 


n = p = 4.8 X 102) Tez, /2eT (23.7) 


Combining this with Eq. (23.6) yields 


o = 4.8 X 10%e(un + pp) Te-2,/07 (23.7a) 
A plot of ln(s/T?) as a function of 1/T leads to an approximatel 
line of slope —E,/2k. Thus E, can be computed directly from the meas- 
ured slope. The deviation from linearity of the curve is due, in part 
at least, to the fact that the mobilities are not independent of 
temperature. 

An optical method for determining E, involves measurement 
of the linear absorption coefficient of a thin film of the semi- 


y straight 


; where J, is the intensity at z = 0 and 


A plot of a as a function of à 
for a particular germanium specimen is shown in Fig. 23.3. Here the 
radiation is in the infrared region. As }- decr 


absorption begins to increase, but at à = 1.7 p it rises abruptly. The 


other possible processes. Because of the presence of free electrons in 
the conduction band, most semiconductors exhibit a metallic luster, but — 


Semiconductors 61S 


100.0 


50.0} 


fa 
© 


= 
_s 


Linear absorption coefficient, cm=! 


s 
in 
r 
} 
| 
| 
pS 


Wavelength, p 


Fig. 23.3 Linear absorption coefficient of a lightly 
doped n-type germanium crystal as a function of wave- 
length of infrared radiation. 


the number per unit volume is insufficient to make the semiconductor 
an excellent reflector. 


23.4 Extrinsic Semiconductors 


In practice the most important semiconductors are those in which certain 
| atomic impurities have been incorporated to introduce new allowed 
energy levels. To be specific, consider the result of adding a few parts 
per miilion of valence-5 arsenic to a silicon crystal. Each As atom 
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á d t i i i 
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Fig. 23.4 (a) An arseni tom bri fi l i i i 
foue arè hig inet covalent bonds the fifth ic readily excited ini yee nich 
b n in i : i i 
nee Aes a oun electrons; a fourth is readily captured 


ke 


E- Acceptor-levela: 
ae aon —- -Ea 


[dani a ce Sane 
_ Valence Band’ 
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Fig;. 28.5; Isolated levels introduced) into: crystal at sites; occupied 
by, (@)) donor. atams-and' (b) acceptor atoms, 


accupies one of the: Sii sites,, bringing: ta it five electrons. when only fou 
are: required! for covalent bonds: with nearest neighbors (Fig. 23.4a)) 
The silicon crystal! is: » medium of high (~12) dielectric constant; partly 
asia result, the fifth electron of the As atom has a binding energy of only 
0.049 eV. By virtue: of thermal excitation, this electron is rarely held 
by the impurity atom and is in the conduction band most. of the time. 
For this reason the As atom is referred to as a donor. A semiconductor 
doped by adding donor atoms is said to be of n type, because the con- 
duction is primarily by electrons from the donors. In the energy dia- 
gram (Fig. 23.5a) donor atoms introduce isolated donor levels near the 
bottom of the conduction band. 

When a semiconductor is doped b 


y adding a few parts per million 
of a valence-3 element such as indium, the impurity atom brings three 
electrons but occupies a site calling for four covalent bonds. Such an 


atom can readily capture an electron (Fig. 23.46), thereby establishing 
the fourth bond; the atom serves as an acceptor. The captured electron 
is not available for conduction, but there remains a hole in the valence 
band from which the electron was excited. A material doped with 


acceptors conducts primarily by holes and is called a p-type semicon- 
ductor. Acceptors introduce isolated energy levels in the forbidden 
band about 0.01 eV aboy 


e the top of the valence band (Fig. 23.58). 
Table 23.1 shows the ener 


gy of levels introduced in Ge and Si by light 
dopings of common donor and acceptor atoms. 


Table 23.1 Levels introduced by light dopings 


Donor Ionization Energy Acceptor Ionization E 
ee wy ton Energy 


Host E. — E., eV 
Crystal So ee eee 
Sb B Al Ga In 


0.0120 | 0.0127 | 0.0096 | 0.0104 0.0102 | 0.0108 | 0.0112 
0.045 | 0.049 | 0.039 | 0.045 | 0.057 0.065 | 0.16 
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Addition of either donors or acceptors increases the conductivity 
above the intrinsic level. In the n-type material, electrons are majority 
carriers, while holes are minority carriers. The situation is reversed in 
p-type conductors, where the majority carriers are holes. š 


23.5 The Fermi Level in Extrinsic Semiconductors 


Adding donors to an intrinsic material raises the Fermi level above the 
level of the gap by an amount which depends both on. the doping and 
on the temperature. To see how and why the Fermi level shifts with 
temperature, consider an n-type material at 0°K, when the crystal is 
in its ground state. All donor levels are occupied, and there are no 
electrons in the conduction band. Since the occupation index f(E) is 1 
up to the donor levels and 0 at the conduction band, the Fermi level 
must lie somewhere in the range Ea < Er < E.. It is not typically 
midway between E4 and E., because the density-of-states functions lack 
the symmetry shown by the intrinsic semiconductor. As T is increased, 
the donors are rapidly excited, and soon half of the donor states are 
emptied, at which temperature the Fermi level coincides with the donor 
levels. As the temperature rises further, electrons are excited from the 
valence band. Eventually the number of electrons from donors is a 
trivial fraction of the electrons in the conduction band. At this point 
the Fermi level has descended to almost the center of the energy gap 
(Fig. 23.6) and the semiconductor behaves essentially as though it were 
intrinsic. Similarly, in a p-type material the Fermi level moves from 
between E, and E, at 0°K to (E, + E.)/2 at high temperature. 

As an example, let us compute the Fermi level for a Ge crystal 
doped with gallium at the rate of 3.7 X 1022 atoms/m® at 300°K, where 
kT = 0.026 eV. By conservation of charge, the sum of the number of 
electrons in the conduction band and the number of ionized ‘acceptors 
is equal to the number of holes in the valence band; therefore, if we let 
E, = 0, 


2.5 X 10%5e-(0.72—p)/0.026 4. eae : 


= 2.5 X 1025e-Er10.026 


2.5 X 10 X 2 X 10-MeBriAT 4 5 Sr À 
oe oF 


= 2.5 X 10%e-Brlkr 
1.48 X 107? = e-BrlkT or Er = 0.17 eV 


n this case the Fermi level lies 0.17 eV above the top of the valence 
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Nam-3 


Temperature, °K. 


Fig. 23.6 Position of the Fermi level as a function of temperature for 
both n- and p-type germanium of various impurity concentrations. (From 
L. V. Azéroff and J. J. Brophy, “ Electronic Processes in Materials.” Copy- 
right 1963. McGraw-Hill Book Company. Used by permission.) 


band and roughly one-fourth the distance to the bottom of the con- 
duction band 

For a doped (or impurity) semiconductor, the carrier densities n 
and p are not equal, but their product is the same as the corresponding 
product for an intrinsic semiconductor so long as Er is several kT from 
both conduction and valence bands. Then 


np = (4.83 X 10”)?T%e-£.kT = (pp) intrinsic (23.8) 


Even when a crystal is doped by adding acceptors, there are always 
some donor levels present. However, as long as the density of acceptors 
exceeds that of donors, the donor electrons fill acceptor levels and the 
result is a p-type material with an effective acceptor density given by 
the difference in densities of acceptor and donor levels. Thus donors 
and acceptors can “compensate” each other. In fact, it is easier to 
fabricate specimens with equal densities of donors and acceptors than 
to produce them free from impurities; a doped crystal with intrinsic 
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behavior is said to be compensated. If Nat and Na are the densities 
of ionized donors and acceptors respectively, the condition for charge 
neutrality requires that 


n+ No =p + Nat (23.9) 


The temperature variation of n and p for an n-type semiconductor 
is shown in Fig. 23.7. Once the Fermi level lies a few kT below the 
donor levels, essentially all the donors are ionized, but over a considerable 
temperature range few electrons are excited from the valence band. Over 
this exhaustion range the conductivity e varies much less rapidly with 
temperature than it does in the impurity range (low temperatures) or 
the intrinsic range (high temperatures). For Ge the exhaustion region 
commonly ends about 400°K, while for Si it is about 500°K. Above 
these temperatures the semiconductors behave essentially intrinsically, 
and the purposes of doping are no longer achieved. 


— = —————— 


Impurity range “| 


Intrinsic, Exhaustion range 
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Electrons 
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Fig. 23.7 Variation of the electron and h 

rig. 2 f ole concentra- 

tions in an n-type semiconductor with the redptecdliet 

the absolute temperature. (From L. V. Azéroff and 

tas Popi wae Processes in Malerials.”: 
rigl ž -Hii b 

pce raw-Hill Book Company. Used by 
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23.6 The p-n Junction 


The interface between a p-type region and an n-type region is called a 
p-n junction. The transition region is typically about 1 y in width and 
is produced in a single host crystal. (If one attempts to make a p-n 
junction’ by pressing a piece of p material against one of n material, 
‘such a plethora of imperfections and associated energy levels in the for- 
bidden region occur that the desired electrical properties of the junction 
are very unlikely.) For the junctions we discuss below we shall assume 
that the junction width is small compared with the average distance a 
minority carrier (either electron or hole) diffuses in the crystal before it 
recombines with a majority carrier. 

The equilibrium energy diagram for a p-n junction is shown in 
Fig. 23.8, where for convenience we place the arbitrary zero for energy 
at the top of the valence band in the n region, and where AE is then 
the energy corresponding to the top of the valence band of the p region. 
To see how such a diagram arises, imagine an ideal n crystal and an 
ideal p crystal of the same material brought together. When equilib- 
rium is established, the Fermi levels coincide. This is achieved by 
electron transfer from the n material to the P, giving rise to a double 
layer of charge in the junction region with electrons filling acceptor 
levels on the p side and leaving ionized donors on the n side. This 
leads to a potential-energy difference AE between the n and p regions. 
For an abrupt junction, one in which the transition region is narrow, 
the fact that the number of electrons gained by the p side is equal to 
the number lost by the n material requires that 


‘NX, = NX> (23.10) 


where X, and X, are the widths of the junction in the n and p regions 


Fig. 23.8 Energy diagram for an unbiased n-p junction, 
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respectively, while Na and Na are the donor and acceptor densities. 
In writing Eq. (23.10) we are assuming that all donors in X, have 
contributed electrons to fill all acceptor states in X,, an approximation 
which is good for an abrupt junction. Over the entire width of the junc- 
tion there are a reduced number of carriers, and we speak of, it as a 
depletion layer. 

The resultant charge density p in the region 0 to X, is Nae; associated 
with it is an electric field E and displacement D.. By V -D = V-xecE = p, 
we have for 0 < z < Xn 


E = Nae (23.11a) 


or 


(23.118) 


Keo 
since E = 0 at z = 0 (see upper part, Fig. 23.8). For 0<2< Xı 
there is an electric intensity En = Naez/xeo to the right. A similar 
treatment for the region X, < z < X; leads to electric intensity 
E, = Ne(X2 — 2)/xeo, also to the right. (In graphs like Fig. 23.8 the 
ordinate is always eleciron energy.) In the range 0 < z < X;, the elec- 

; X 
tron energy rises by Í (e*Nax dz)/xeo and for Xı < z < X: by 


JÈ (etx d2)/xeo, so that 


Naet Xn? 
AEn = — Sres (23.12a) 
and 
Nae? X,’ 
AE, = Sa (23.12b) 


From Egs. (23.10) and (23.12) one sees that heavy dopings (great Na 
and N.) lead to narrow junctions and light dopings to broad ones. 


23.7 The p-n Rectifier 


When equilibrium is established at a p-n juncti x 
equal rates in both directions and so also pea electrons migrate at 


n boi y holes. Whil ; 
vastly greater electron density in the conduction band of the lege 
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only electrons with kinetic energy greater than AE in Fig: 23.8 can 
surmount the potential barrier. On the other hand, any electron reach- 
ing the junction of the p side finds the junction field exerting a force 
toward the n side. The hole density in the p region exceeds that in 
the n region, but again the barrier layer equalizes the flow of holes. 
(Remember that for holes lower energy states lie upward, just opposite 
to those for electrons.) 

When the Fermi levels on both sides of the junction coincide, the 
current Io carried by electrons moving from the p to the n region has 
a magnitude proportional to the electron density in the p region and 
may be written by Eq. (23.3a) as Cye-*-7. This is balanced by an 
equal flow of electrons from the n to the p region which is proportional 
to the number of electrons in the n region with energy great enough to 
surmount the barrier AE. By Eq. (21.14), this current has the magnitude 
Cye~4£/*7 and we have 


Toe = CeBit == Cye-SE AT (23.132) 


Similar relations for holes can be found (with the aid of Fig. 23.8) to be 
Tox = Cye-85 AT = Cye- -EDIT (23.136) 


Let us now consider what happens when we apply an external bias 
voltage at the junction. At this point we follow convention by defining 
the junction current to be positive when the p region is biased positive 
and the n region negative. Then the conventional current is in the 
—z direction of Fig. 23.9a and one speaks of the forward current asso- 
ciated with the forward bias. For this case V is positive. When the n 
region is biased positive, V is taken as negative and the junction is 


(a) z (b) s (c) 


ig. 23.9 An n-p junction (a) forward-biased and (b) reverse-bi AT 
Re as a function of bias voltage. iased 
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reverse-biased (Fig. 23.96); the resulting flow of charge is termed the 
reverse current. 

When a bias voltage is applied to a p-n rectifier, most of it ordinarily 
appears, across the junction region, which is depleted of carriers and 
therefore has high resistivity. We shall let V represent the bias across 
the junction region proper; the applied voltage must then be V plus the 
IP drops in the p and n regions on each side of the junction. 

Whether the junction is biased or not, the currents Cye-*>/t7 and 
Cye~@--F»/*T are unaffected, since they depend only on the appropriate 
carrier densities. However, the barrier for electrons going from n to p 
and for holes going from p to n material becomes AE — Ve (recall that 
V is positive for forward bias, negative for reverse bias). The resultant 
current at the junction is then given by 


I = Cye-@8-vorr — Cye-2o/T + Cye“ 4 5-VOT CyeF,- Ba) kT 


= (lo + Tox) (eV /T -l= I(eve#? — 1) 2 (23.14) 


where Io = Ice + Io. The current-voltage characteristics of the jig 
tion are shown in Fig. 23.9c, which illustrates the strong rectifying 
property of the p-n junction. The current rises sharply for increasin> 
forward bias but approaches — Io for increasing negative bias. (At 
high negative bias avalanche breakdown occurs when the carriers gain 
enough energy to create additional carriers in collisions.) 

Not only does the p-n junction have an interesting current-voltage 
relationship, but it also has in some cases important capacitance char- 
acteristics. As we saw in Sec. 23.6, the junction region is depleted 
in carriers over a small distance and thus has the properties of an elec- 
trical capacitance per unit area of magnitude C4 = xeo/w, where x is 
the dielectric constant of the crystal and w the junction width. Under 
reverse bias the junction width is increased, so that the junction capaci- 
tance decreases. 


23.8 The Photovoltaic Effect 


When photons fall on a p-n junction, electrons are lifted from th 
, e 
band to the conduction band provided the Photon ener PE 


the gap energy. Thus additional carriers, both iiot HY hiies n 


electric 
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Fig. 23.10 (a) Flow of minority carriers driven by absorption of photons near 
a p-n junction. (b) Schematic diagram of a circuit with a photovoltaic emf. 


(c) Load current Fz in circuit of (b) as a function of the terminal potential dif- 
ference Vz from the photovoltaic cell. 


increase the minority-carrier densities substantially in both p and n 
regions and an appreciably greater minority-carrier transfer is produced 
across the junction. . 

When a p-n junction is reverse-biased with |Ve|>> kT and kept 
in the dark, the resulting reverse current as given by Eq. (23.14) is 
Ir = I(l — e-!¥ell#T) ~ Io where Io is called the-dark current in this 
application. When the junction is illuminated, photons generate new 
carriers which enhance the reverse current by an amount Ip, so that in 
the presence of the light the reverse current is Jp + Ip. By cooling the 
junction, one can make Io < I,. The resulting pholodiode is a sensitive 
device for detecting and measuring beams of photons. Properly designed 
Photodiodes are useful detectors of high-energy charged particles, x-rays, 
and > rays as well as of infrared and visible radiation. For detecting 
in the far-infrared region a base material such as InSb with a small energy 
gap is required, since the gap energy must be less than the photon energy. 

When no external potential difference is applied to an illuminated 
p-n junction, the photon-induced charge transfer results in producing 
a photovoltage V, across the barrier layer with the n region negative 
and the p region positive. We can compute the photovoltaic emf in 
terms of J, and Io as follows. Once equilibrium is established in the 
crystal in the absence of an external circuit, the forward current due to 
V, just balances the photon-induced J,, so that 0 = I o(eVoAT — 1) — Tp, 
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from which 
kT I 
V, = In (z + 1) (23.15) 


When I, >> Io the photovoltaic emf is proportional to the logarithm 
of the photocurrent J,, a desirable response for a cell to be used asa 
light meter. / 

An important, application of the photovoltaic effect is found in 
the solar cell, which converts radiant energy from the sun to electrical 
energy. The solar radiation curve is continuous with a broad maximum 
iu the green region of the visible spectrum, corresponding to a photon 
energy of 2.5 eV. For conversion of solar radiation by a p-n junction 
an energy gap of 1.0 to 1.5 eV is desirable. Silicon with E, = 1.1 eV 
has been widely used for solar cells. Since electrons and holes produced 
far from the junction ordinarily recombine, it is desirable that the junc- 
tion of a silicon solar cell be very close to the surface; typically a p layer 
about 1 y in thickness is formed at. the surface which is subsequently 
covered with an antireflection coating similar to that used on lenses. 

When a solar cell is used to supply power to a load (Fig. 23.100), 
the photon-generated current I, is equal to the load current I, plus the 
forward junction current due to the induced potential difference V. 


Ip = In + Ip(e¥et™ — 1) (23.16) 


The voltage Vz appearing across the load is the generated potential 
difference minus the potential drop ILR. where R. is the solar cell 
resistance, due almost entirely to the very thin p layer. Combining 
Vi = V — IR. with Eq. (23.16) leads to 


T. = 
Vine BE if (= a 1) EDYSR (23.17) 


The current-voltage curve for a 1.7-cm? solar cell with R. = 4 Q is 
shown in Fig. 23.10c. A good silicon solar cell may have an efficiency 
(electrical power delivered to load divided by total incident solar power) 
of about 15 percent. 


23.9 The Tunnel (or Esaki) Diode 


When junctions are very heavily doped 


(21025 atoms/m? 
and acceptor levels are no longer sharp a pE 


nd isolated but are spread into 
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bands, which overlap the valence and conduction bands (Fig. 23.11a). 
The Fermi level on the n side moves into the conduction band, and that 
in the p region moves into the valence band. Further, the junction 
width becomes very narrow (~100 A), so that quantum-mechanical 

- tunneling may occur through the barrier at the interface. The tunnel 
current depends sensitively on the junction width. 

At equilibrium the number of electrons tunneling in both directions 
through the barrier is the same. For a small forward bias (Fig. 23.11) 
the number of electrons tunneling from the n to the p side is greatly 
increased because the number of empty states to which they can transfer 
is large; at the same time tunneling in the opposite direction is reduced. 
However, as the forward bias is increased, the states into which elec- 
trons may transfer by tunneling decrease and soon disappear (Fig. 
23.11c). As V is further increased, the forward current is due to elec- 
trons which have enough energy to surmount the potential barrier for 
which Eq. (23.14) is applicable. 

The resulting current-voltage curve for the tunnel diode is shown 
in Fig. 23.12. In the region between the peak and the minimum, the 


slope of the curve is negative, and the diode has a negative dynamic 
resistance dI/dV. Such a negative resistance is a very useful property. 
By suitable choice of components it is possible to produce a circuit with 
A circuit containing an inductor, capacitor, and 


a biased tunnel diode can form a simple oscillator capable of generating 
frequencies as high as 10!! Hz. 


no dynamic resistance. 


(b) (e) 


Fig, 23.11 A tunnel, or Esaki, diode (a) in equilibrium with no bias, (b) with a 
small forward bias (~0.05 V in Fig. 23.12), and (c) with a large forward bias 
(0.2 V in Fig. 23.12). 
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Fig. 23.12 The current as a function of applied potential bias for 
a tunnel diode. (After L. Esaki.) 


23.10 _Metal-semiconductor Junctions 


When one makes an electrical contact to a semiconductor crystal, the 
contact may act as a rectifier or as an ohmic connection, depending on 
the Fermi levels of the materials involved. If the connection is to an 
n-type crystal, the contact is ohmic when the Fermi level of the metal lies 
above that of the semiconductor and rectifying when it lies below. Fora 
connection to a p-type semiconductor, the contact is ohmic when the 
Fermi level of the metal lies lower than that of the semiconductor and 
rectifying if it lies above. 

Consider a junction formed between the metal and n-type semi- 
conductor of Fig. 23.13. When contact is made, electrons flow from the 
metal until the Fermi levels coincide, thereby charging the semiconductor 
negative and leaving additional electrons on the semiconductor side of 
the junction. When a potential difference is applied in either direction, 
there are abundant carriers at the junction to give current in either 


direction ‘and the current is proportional to the applied potential differ- 
ence. The junction is ohmic. 
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Fig. 23.13 An ohmic junction between a metal and an n-type semi- 
conductor; at the junction carriers are plentiful on both sides of the 
junction. 


If the Fermi level in the metal lies below that in the n-type semi- 
conductor (Fig. 23.14a), electrons go from semiconductor to metal, 
thereby producing a positive depletion layer of ionized donors. When 
equilibrium is achieved, the metal surface bears a negative charge and 
electrons from the metal must have an energy greater than Ey + AE to 
diffuse to the semiconductor. The number which do surmount the 
barrier is just equal to the number which move from semiconductor to 
metal. The current in both directions, each of magnitude Io, are pro- 
portional to e~44/*7, When a voltage is applied at the junction, it does 
nothing about the height of the barrier on the metal side; that is deter- 
mined entirely by the nature of the metal and of the semiconducting 
crystal. Because the depletion layer has reduced conductivity, most of 
the potential difference appears across it. For forward bias (n material 
negative) the Fermi level is raised relative to that in the metal (Fig. 
23.146), and the current associated with electrons going from semicon- 
ductor to metal increases by a factor eY**7. As a result, the forward 
current becomes 


I = Iet — 1) (23.18) 


For reverse bias, Eq. (23.18) is still applicable; V is then negative. 


Thus we see the junction current follows the same equation as that of a 
p-n junction. 


(a) (b) 


(a) A rectifying junction between a metal and an\n-type semiconduc- 


Fig. 23.14 s 
tor; there is a depletion layer at the junction. 


d bi (6) The junction of (a) subject to 
forward bias. 
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5 When metal contacts are made to a p-type crystal, conduction 
in the semiconductor js primarily by holes. If the Fermi level of the 
metal lies above that of the semiconductor, electrons from the metal fill 
the holes at the junction and a rectifying depletion layer forms. When 
the Fermi level lies lower in the metal, electrons from the semiconductor 
go to the metal, giving additional carriers at the junction, which is then 
ohmic. 


23.11 Thermoelectricity 


When a current passes across the junction between dissimilar conductors, 
heat is either generated or absorbed at the junction (in addition to the 
joule heat liberated). This phenomenon is called the Peltier effect. The 
thermal energy H removed from the junction when a charge q passes 
from material a to material b is 


H = Taq (23.19) 


where Ies» is called the Peltier emf (or coefficient). 

Consider the circuit of Fig. 23.15, in which a current is passed 
through an n-type semiconductor with ohmic contacts to the same metal 
at each end. (The double layer implied in Fig. 23.13 exists over such a 
short distance at each junction that it does not appear on this scale.) 
For the current direction shown, the junction at the right is cooled (H 
positive), while that at the left is heated (H negative). At the left 
contact, electrons going from semiconductor to metal enter the metal 
with energy greater than Hy in the metal. This energy is rapidly trans- 
ferred to the metal lattice and appears as heat. At the right contact 
only electrons with energy greater than Ey + AE can enter the semi- 


Fig. 23.15 When a current is passed through the semiconductor in 
ee Ae the junciion at the right is cooled and that at the left is 
eated. e : 
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conductor. The metal loses electrons occupying higher energy states 
but not those of lower energy. This leads to a lowering of the average 
electron energy and to a cooling of the junction. i 

At the left junction, which is heated, H is negative for conventional 
current going from metal to semiconductor. On the average electrons 
fall from a level Ee + 2kT in the semiconductor to energy Ep in the metal, 
with this difference appearing as heat. By (Eq. 23.19), 


Dnon = —Ohom = 


_ E.+ 2kT — Er 
= (23.20) 


where e = +1.6 X 10-"° C. The 2kT in this expression is the average 
kinetic energy of the electrons crossing the junction. The average kinetic 
energy of electrons in the conduction band is about 3kT (if E.— Ep > 
5kT), but electrons of higher speed are more likely to cross the junction. 
For p-type material II is positive for current from metal to semiconductor 
and n+p = —Ipsm = (Er + 2kT — E.)/e. 

Peltier emfs for metal-semiconductor junctions are typically much 
larger than for metal-metal junctions because the average potential 
energy of the carriers is larger than the Fermi energy in semiconductors, 
whereas the reverse is true for metals. The Peltier emf for doped semi- 
conductors depends on the junction temperature, not only by virtue of 
the term 2kT but also because the position of the Fermi level in the 
energy gap varies with temperature. .By careful choice of p and n semi- 
conductors, it is possible to pass a conventional current from the n speci- 
men to metal and then to the p specimen, thus cooling the metal at both 
contacts. With such an arrangement the metal can be cooled as much 
as 80°C below the temperature of the remainder of the circuit. 

In addition to the Peltier emf, which appears at any junction 
between dissimilar conductors, there is also a Thomson emf, which 
appears between the ends of a conductor when they are at different 
temperatures. The sign and magnitude of the Thomson emf depend 
on the material of which the conductor is made. The sum (with due 
regard to sign) of all Peltier and Thomson emfs around a closed circuit 


is known as the Seebeck emf, which appears when two junctions in a 
circuit are at different temperatures, 


23.12 The Transistor 


A junction transistor is composed of two p-n junctions close together in 
the same single crystal. There are two basic types, the p-n-p and the 
n-p-n transistor, depending on the conduction type of the crystal between 
the two junctions. The operation of both types is fundamentally simi- 


lar, but holes are the majority cafriers for the p-n-p transistors and elec- 


4 


Semiconductors 631 


trons for the n-p-n. Just as with vacuum tubes, a broad range of 
operating characteristics can be achieved by various choices of the many 
controllable variables in transistor design. We describe below some of 
the physics underlying the operation of an n-p-n transistor. 

A common type of n-p-n transistor consists of a strongly doped n 
region separated from a moderately doped n region by a thin (~20 y) 
lightly doped p region. The energy-level diagram for equilibrium in the 
absence of applied voltages is shown in Fig. 23.16a. In operation, the 
stronger n region, known as the emiller, is forward-biased, so that it 
injects electrons into the p region, or base. It is very important that 
the diffusion length (average distance a minority carrier travels before 
recombining with a majority carrier) for electrons be very large compared 
to the thickness of the base, so that a very small fraction of the electrons 
entering the base from the emitter combine with holes. - Almost all 
the electrons pass through the p-n junction into the collector, which is 
reverse-biased. The energy-level diagram for a biased transistor appears 
in Fig. 23.166. It should be noted that the equilibrium concentration 
of electrons is not achieved in the base during normal operation, when 
most of the electrons injected from the emitter simply diffuse through 
the base. At the base-collector junction the reverse bias sweeps these 
_ electrons into the collector. 

The ratio of the collector current J, to the emitter current J, is 
known as the current gain factor œ, which approaches 1 in many well- 
designed transistors. This factor is the product of the emitter efficiency 
y and the base transport efficiency e. The emitter efficiency is the frac- 
tion of the emitter-junction current carried by majority carriers—in 
this case, electrons. It is to achieve a y near unity that the emitter is 
strongly doped. For the n-p-n transistor the base transport efficiency 
is the ratio of the collector current to the electron current injected at 


Fig. 23.16 The energy bands for n-p-n junction transistor (a) in equilibrium 
with no applied potential differences and (6) biased for operation as an amplifier. 
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Emitter Base Collector 


RL 


Fig. 23.17 A grounded-emitter 
circuit in which an n-p-n tran- 
sistor serves as an amplifier. 


the emitter junction; e approaches unity wher the base is very narrow 
and the diffusion length for electrons in the p material is large. It is 
advantageous to have modest doping for the base region, since heavier 
doping results in a shorter diffusion length for minority carriers. With 
a collector potential of only a few volts, the collector current becomes 
almost equal to the emitter current. 

Transistors can perform many functions in electronic circuits, 
For example, the transistor of Fig. 23.17 is connected to serve as a 
grounded-emitter amplifier. The term grounded-emilter implies that the 
emitter is common to both input and output. (For specific purposes 
grounded-base and grounded-collector circuits are also used.) While a full 
analysis of an amplifier circuit requires knowledge of all the circuit 
components, it is easy to see qualitatively how amplification arises. A 
small change in the forward bias at the emitter junction produces a 
large gain in emitter current and a corresponding change in collector 
currentife ~1. If R; is large (but I.R; remains less than the collector 
bias voltage!), the corresponding change in potential difference across 
Ry can be large compared to the change in voltage at the emitter junc- 
tion. Power gains of 40 to 50 db are not uncommon. Junction transis- 


tors can Operate at the microwatt power level; they are small, rugged, 
and readily adapted to a wide range of applications, 


Problems 
1. Calculate the numbers of electrons and holes per unit volume for a 
pure germanium crystal at 200, 300, and 400°K, (Take E, = 0.70 eV.) 


Ans: 3.6-X 10°, 5.1 X 10”, and 2.0 X 1021 m-? 
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2. Silicon is fairly transparent to infrared radiation of wavelength 
greater than 1.1 » but absorbs shorter wavelengths strongly. From 
this information estimate the energy gap in silicon. 
3. (a) Calculate the number of electrons and holes in intrinsic silicon 
at 300°K. 

(b) If the mobilities of electrons and holes are 0.12 and 0.05 m?/Y-s 
respectively, find the conductivity of Si at 300°K. 

(c) Compute the expected Hall coefficient at 300°K. 

Ans: (a) 1.5 X 10** m~; (b) 4.1 X 10-4 mho/m; (e) 170 m?/C 

4, Assume that four electrons of an isolated As donor atom in a Ge 
crystal form valence bonds and that the fifth exists in the first Bohr 
orbit about an effective nuclear charge of +e in a medium of dielectric 
constant 16. Find the binding energy and the radius of this orbit. 
Compare the radius with the nearest-neighbor separation in the Ge 
crystal. 


Ans: —0.053 eV; 8.5 A; 3.5 times nearest-neighbor distance 


5. (a) Estimate the energy of the lowest bound state of the fifth valence 
electron of an arsenic atom in a silicon lattice (Fig. 23.4a), assuming that 
the specific inductive capacitance of Si is 12. 

(b) Calculate the radius of the first Bohr orbit. 

(c) Find the ratio of this radius to the interatomic spacing between 
silicon atoms if a = 5.4 A. 


6. Find the position of the Fermi level for a germanium semiconductor 
doped with phosporus at the rate of 5 X 1022 atoms/m! at temperatures 
of 200, 300, and 400°K. 

Ans: 0.10, 0.16, 0.23 eV below E. 
7. A germanium crystal is doped with phosphorus at 102% atoms/m? 
and with aluminum at 5 X 1022 atoms/m*. How does the conductivity 
of this crystal compare with that of the crystal of Prob. 6 when both 
crystals are at 300°K? Calculate the conductivity if the mobilities of 
holes and electrons are respectively 0.20 and 0.40 m?/V-s. How many 
times is the conductivity greater than that of pure Ge? 

Ans: 3.2 X 10? mhos/m; 1600 
8. Show that if a semiconductor has a zero Hall coefficient, the frac- 
tion f of the current carried by electrons is given by f = Hp/ (un + up). 
Find f for a specimen of silicon for which the mobility is 0.12 m?/V-s for 
electrons and 0.05 for holes. 
9. An alternative way to explain the rectifying characteristics of a 
o-n junction is based on changes in the width of the depletion region. 
Discuss the rectification on that basis. 


i 
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10. Estimate with the aid of Fig. 23.6 the internal potential difference 
AV (= AE/e) in a germanium p-n junction at room temperature for an 
acceptor concentration of 10?! atoms/m? and a donor concentration of 
1025 atoms/m*. Calculate the width of the junction. 

11. An intrinsic semiconductor has a band gap of 0.2 eV as determined 
by optical absorption. If it has a conductivity of 3 mhos/m at 290°K, 
calculate the conductivity at 400°K, assuming that the mobilities of both 
electrons and holes remain constant and that the effective masses of 
holes and electrons are equal. 


12. (a) Show that the width w of an abrupt junction (given by 
w = Xn + Xp) is equal to ~< 


2 [ Geox AE) (7: 2 x) 


where the quantities are defined in Sec. 23.6. 
(b) Show that when Na & Nu, 


ae ud 2eoK AEN: 
FOVET 


13. A p-n junction involves a double layer of opposite charges separated 
by a narrow depletion layer and thus acts as a capacitor. From the fact 
that a parallel-plate capacitor has a capacitance C = eK A/d, where A 
is the area and d the separation, show that a p-n junction for which 
Na >> N, has a capacitance per unit area given approximately by 


Gite eK Ne? |i 
a | 2(AE + Ve) 


where V is the potential difference applied at the junction. Thus the 
capacitance varies with applied voltage. (Use the results of the preced- 
ing problem.) 
14. A silicon p-n junction has gallium at 1024 atoms/m! for Na and 
arsenic at 2 X 10% atoms/m? for Nz. What is the approximate value 
of AE for this junction? What is the approximate junction thickness? 


15. Show that the average kinetic energy of electrons going “over the 
falls” from the n material of Fig. 23.15 to the metal is 2kT. 


chapter twenty-four 


Interactions of High-energy 
Particles with Matter 


When high-energy charged parlicles and photons pass through a sub- 
slance, they interact with the aloms of the material in a variely of ways. 
Some understanding of the processes involved is a valuable preface to a discus- 
sion of nuclear physics, since in nuclear reactions the target nuclei, the 
bombarding particles, and the resulling products are far too small to see 
and {oo light to weigh on the most sensitive balance. Consequenily, the 
unraveling of a nuclear reaction is often based on information inferred from 
indirect evidence involving interactions of the particles with malter through 
which they pass. We discuss first the processes by which high-energy 
photons are removed fram a beam. 


24.1 Attenuation of a Photon Beam 


When photons have their origin in nuclear transitions, they are called 
y rays. If they result from the acceleration of free electrons or other 
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charged particles, they are known as bremsstrahlung (braking radiation) 
or as x-rays. If they arise from interactions of primary cosmic-ray 
particles in the upper atmosphere, they are called secondary cosmic rays. 
If their origin is in the combination of a positron and an electron, they 
are described as annihilation radiation. The properties of the photons 
depend on the frequency and not at all on their origin. 

When a narrow collimated beam of monochromatic high-energy 
photons passes through a thin film of thickness z, the intensity J trans- 
mitted in the same direction and at the same wavelength is given by 


I = Ie (24.1) 


where Tois the incident intensity and y is the linear attenuation coefficient. 
(It is sometimes referred to in the literature as the linear absorption 
coefficient; we prefer to reserve the term absorption coefficient to refer 
to that part of the energy actually converted to internal energy within 
the absorber.) An exponential attenuation law is characteristic of proc- 
esses in which a particle is removed from a beam in a single event. In 
contrast, when « particles are absorbed in air, each one loses a small 
fraction of its energy in each of many encounters with air molecules; 


when all the kinetic energy has been dissipated, the æ particle is said to 
have been stopped. 


The ratio of the linear attenuation coefficient to the density of the 
absorber is the mass allenualion coefficient um, which has the useful prop- 
erty that it is independent of the particular phase in which a given absorb- 
ing material may be. For example, it is the same for water vapor, 
water, and ice. If the mass attenuation coefficient for any element is 
multiplied by the atomic weight A and divided by Avogadro’s number 
Na, the result is the atomic allenuation coefficient pa 


= PA _ pA 
te = = ONE (24.2) 


The atomic attenuation coefficient is a useful concept in that it 
enables one to calculate the molecular attenuation coefficient of any type 
of molecule simply by adding the contributions of each of the atoms 


composing the molecule. For example, the molecular attenuation coeffi- 
cient of copper sulfate is given by 


Hcu80o, = Meu + us + 4uo (24.3) 
Thus one can calculate the molecular attenuation coefficient for any 


chemical compound for which one knows the chemical formula and the 
atomic coefficients of the atoms involved. 
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24.2 Attenuation Processes 


X-ray photons can be removed from a well-collimated beam by a number 
of processes, of which photoelectric absorption is usually most important 
for energies in the lower range, scattering for intermediate energies, and 
pair production at high energies. In addition, photons of sufficiently 
high energy can be absorbed through nuclear reactions resulting in the 
ejection of a particle from a nucleus or in the fission of an atom. For 
example, 2.2-MeV photons can split a deuteron into a proton and a 
neutron, a process called photodisintegration. Ordinarily the cross sec- 
tion for such reactions is relatively small, and so we can write the atomic 
attenuation coefficient as 


Ha = Ta toa + Ta (24.4) 


where Ta, oa, and Ta are respectively the atomic cross sections for photo- 
electric absorption, scattering, and pair production respectively. Which 
of these processes is dominant depends on the energy of the photons and 
on the atomic number of the absorber (Fig. 24.1). 


a. Photoelectric Absorption X-ray photons with energies in the keV 
range are absorbed primarily through the photoelectric process. For 
photon energies greater than the K binding energy, the atomic photo- 


Photoelectric effect Pair production 
dominant dominant 


Z of absorber 


Compton effect 
dominant 


0.01 0.05 01 os 1 5 10 50 100 
hv, MeV 


Fig. 24.1 High-energy photons are removed from a collimated 
beam by photoelectric absorption, scattering, and pair production. 
The lines show the values of Z and Ay for which the two neighboring 
effects are equal. (From R. D. Evans,“ The Atomic Nucleus.” Copy- 
right 1955. McGraw-Hill Book Company. Used by permission.) 
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electric cross section of an atom decreases rapidly as decreases. A 
crude approximation is 


te = CZ (24.5) 


where C is a constant equal roughly to 2.6 X 10-*° m?. For à well 
below the K edge, Ta varies more nearly as \** for a modest range of 
wavelengths; the exponent then decreases so that for photon energy 
E> 0.5 MeV, ra is roughly proportional to à. As photon energies 
increase, the exponent of Z also increases; for E = 3 MeV, ra goes as 
Z*6, and for very high energies Zř is probably a reasonable estimate. 
However, for energies in the MeV range, ra is less than the scattering 
cross section. 


b. Scattering Photons are also removed from a well-collimated beam 
by scattering processes, both of the Compton type (Sec. 7.10), in which 
part of the photon energy is transferred to an electron, and of the Thom- 
son-Rayleigh type (Sec. 7.9). The latter is more probable for long 
wavelengths, while for high-energy photons Compton scattering is the 
more important type. In 1929 Klein and Nishina applied Dirac’s 
relativistic formulation of quantum mechanics to achieve a unified theory 


of scattering which correctly predicts both the angular intensities and 
the polarization of scattered radiation. 


Consider polarized quanta of energy hy moving along the z axis 
with the electric vector in the z direction. Let J be the incident intensity 
and consider the radiation of energy hve scattered by a single free elec- 
tron at an angle @ with the z axis. Let y be the angle between plane of 
scattering (i.e. the plane determined by the incident and scattered 
quanta) and the zz plane. In the beam scattered at an angle 0, the 
intensity may be considered to be the sum of two linearly polarized beams, 
one with the electric vector perpendicular to the zz plane and the other 
with the electric vector in the az plane, indicated by I, and I, respec- 


tively. According to the Klein-Nishina theory, at distance r from the 
scattering electron, 


Wal! e? 2 (hve hve 
To = lea) Gy (Z+ SRT ia 2) (24.6a) 
I (_& Nè (ho fh i 
In = a3 (5a) a) T dll ig Z +2 — 4 sin? 8 cos? v) 
(24.6b) 
where e?/4meomc? is the classical radius of the electron, Eq. (2.16). The 
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total intensity J, scattered at angle @ from an unpolarized primary beam 


can be computed by averaging Eqs. (24.6a, b) over y and adding, which 
gives the general relation 


= I e 2 (hve\* hve hv . 
To 2r? (re) D) (z Y hve sin? D) (24.7) 


From (Eq. 7.16a) it follows directly that 


y 
= TF e — cos 6) os) 


where e = hv/mc?. Substitution of Eq. (24.8) into (24.7) yields 


: afer ay 1 3 
I = 978 (z Za) (1 + cosè 0) |I E e= cos 6) 


e(l — cos 6)? 
1 + F cos oll + eA — cos mal (24.9) 


The intensity distribution for Klein-Nishina scattering as a func- 
tion of @ is shown in Fig. 24.2 for several values of «. The experimental 
points are those of Friedrich and Goldhaber for a wavelength of 0.14 


scattered by carbon. 
When e(l — cos 0) > 1, the scattered radiation is unpolarized, even 


Fig. 24.2 Ratio of the intensity of the beam scattered at 
angle @ to the scattered intensity at @ = 0° for four values of 
e = hv/me*. 
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when the incident photons are polarized. In this case 


I er Na 1 24 
Is = Io + In = 210, = 2r \Greme?) a — cos)? (24.9a) 


To obtain the total power P, removed from the beam by scattering 
by an electron, it is not sufficient to integrate J, over a sphere surrounding 
the electron because each photon removed has energy hy, while the 
scattered photon has energy hve. Rather 


a r z. 2 of 
P. = if, Io nr sin 6 dé 


By use of Eqs. (24.8) and (24.9) P, can be found and from it the free- 
electron scattering cross section c.. 


P. e \fl+e[20+9_1 
=F 2 (Sa) | 3 [Aa =In a +20] 


+ pin (1 +20 - HFa) (24.10) 


@ + 262 
For e & 1, Eq. (24.10) may be expanded.in a power series in e. 


8r ey 2 2 a 24.10 
o = 3 (Trama) (1 2e + 5.2? — 13.384 ---) (24.102) 


This reduces to the Thomson classical cross section as e— 0. For 
«œ> 1, Eq. (24.10) becomes 


2 2 
n=% (rie) neit (24.108) 
Thus the scattering cross section is almost inversely proportional to 
photon energy when hy >> me, 

To photons with energy well in excess of the K-shell binding energy, 
all electrons are essentially free, and all scatter essentially equally and 
incoherently in accord with Eq. (24.10). Consequently the atomic scat- 
tering cross section of any element is given by ca = Zo,. Similarly, the 
linear scattering coefficient of a material is simply no., where n represents 
the total number of electrons por unit volume. 
c. Pair Production The pair-production cross section rises from zero 
for hv < 1.02 MeV to become the dominant factor in the attenuation 
coefficient for very high energies. For hy < 50 MeV the atomic pair- 
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Fig. 24.3 Theoretical cross sections for the interaction of high-energy 
photons with atoms of lead, aluminum, and air (Z taken as 7.26 for air). 
For all three, sa = Zoe. The photoelectric cross section ra is shown only 
for Pb; it is inappreciable for Al and air for hv > 1 MeV. (From dala in 
W. Heiller, “The Quantum Theory of Radiation,” 2d ed., 1944, Oxford: Uni- 
versity Press.) 


production cross section is proportional to Z?. As the photon energy 
increases above the threshold value, ra increases slowly at first and then 
more rapidly as the energy increases, finally reaching an essentially con- 
stant value at some hundreds of MeV. For hy > 137moc?/Z}, the cross 
section is given by 


Zi f e \?/28, 183 2 
= ues 11 
Te TTET (r2) G l z 2) CAM) 


Theoretical values of 7. divided by Z are shown for three materials, in 
Fig. 24.3. 


The mean distance L that a high-energy photon travels before pro- 
ducing a pair is L = 1/r.V, where N represents the number of atomic 
nuclei per unit volume. Some values of L are as follows: 


Mean Distance for Pair Production L, cm 


Energy, hv, MeV 


In Standard Air In Al | In Pb 


25 9.8 X 104 27.4 1.25 
100 5.9 X 10¢ 17.0 0.86 
1000 4.5 X 104 13.2 0.70 
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Fig. 24.4 Mass attenuation coefficients for sodium iodide. The 
scattering and absorption components of the Compton attenua- 
tion coefficient and the total mass absorption coefficient are shown 
by dashed lines. (Adapted from R. D. Evans, “The Alomic 


Nucleus.” Copyright 1955. McGraw-Hill Book Company. Used 
by permission.) 


For all elements a plot of the attenuation coefficient as a function 
of photon energy is decreasing for hv = 1 MeV. As the photon energy 
increases, the contribution of pair production grows rapidly. This leads 
to a minimum of p, at a photon energy which depends on Z. Minimum 
attenuation occurs for lead at hy ~ 3.5 MeV, for copper about 10 MeV, 
for aluminum about 25 MeV, and for air near 50 MeV. To summarize 
the processes of photon attenuation, Fig. 24.4 shows by solid lines the 
mass attenuation coefficients for sodium iodide, a material of great 
importance for scintillation detectors. The dashed lines are discussed in 
the next section. 
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24.3 Absorption vs. Attenuation 


When a physicist measures attenuation coefficients, ideally he has a 
fine beam of single-energy photons (Fig. 24.5a) and considers as being 
removed from the beam every photon which fails to emerge with the 
same energy and same direction as it entered. In this case one speaks 
of a narrow-beam or good-geometry experiment. On the other hand, an 
experiment in which a significant fraction of the scattered or secondary 
particles can continue in the beam (Fig. 24.58) is described as broad beam 
or bad geometry. In a broad-beam arrangement much of the incident 
intensity may leave the absorber with wavelength and direction modified. 
The photons which escape are not absorbed, since their energy does not 
remain in the absorbing material. If we consider the processes which go 
into making up the total attenuation coefficient, we observe that in the 
photoelectric absorption a photon is completely removed from the beam 
and its energy transformed into kinetic energy of the ejected photo- 
electron plus potential energy associated with the binding energy of that. 
electron. Even so, some atom in the absorber is left in an excited state 
and may emit a photon of longer wavelength, which rejoins a broad beam. 
Similarly, a photon scattered through a small angle such as 10° can escape 
from the absorber in the broad-beam experiment, while it is counted as 
removed in a good-geometry experiment. A high-energy photon which 
creates pairs is removed from the beam, but the high-energy electron 
and positron may produce bremsstrahlung and annihilation radiation, 
which augment the broad beam. A dramatic example of this effect is 
to be found in the cosmic radiation. High in the upper atmosphere there 
is a flux of extremely high-energy photons, or secondary cosmic rays. 
Through pair production, followed by bremsstrahlung and annihilation 
radiation, each very high-energy photon on the average becomes a large 
number of lower-energy photons and associated secondary electron pairs. 
These offspring are much more strongly absorbed in air; as a conse- 
quence, the energy absorbed per gram of air increases over a substantial 
region as the cosmic rays descend through the atmosphere, even though 


Slits. 
Absorber Absorber 
Source Detector Source = Detector 
rm 
(a) (6) 


Fig. 24.5 An example of (a) narrow-beam and (b) broad-beam geometry, 
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- the total intensity (energy incident per unit area per second) is actually 
decreasing. In both Compton scattering and pair production, a portion 
of the energy of the incident beam is absorbed and appears as heat in the 
absorbing layer. Another substantial portion may pass through with 
reduced photon energy. 

In a layer of thickness dz the energy dI», absorbed per second per 
unit area is proportional to the intensity J which reaches the layer; hence 


dIa = xI dx (24.12) 


where « is the linear absorption coefficient. Clearly « is smaller than 
the linear attenuation coefficient, since the latter takes into account the 
energy content of all scattered photons as well as secondary photons from 
excited atoms and from pair bremsstrahlung and pair annihilation. For 
photons with energy of the order of 1.5 McV, about half the energy of a 
Compton scattered photon is absorbed on the average and half remains 
in the beam. In Fig. 24.4 the dashed curves show «/p ‘and how the 
Compton coefficient is divided between absorption and scattered photons. 

The damage produced in a biological specimen or an inanimate 
object such as a semiconductor circuit depends primarily on the amount 
of energy per unit volume which is dissipated in the absorber rather than 
on the total energy incident, Indeed, photons which pass completely 
through without interacting with the specimen cause no damage. For 
discussing radiation damage it is convenient to define the dose rate for 
a sample exposed to a photon beam as the energy absorbed per unil mass 
per unit time. 

In medical work the total x-ray energy falling on the subject is of 
minor interest, but the amount of energy absorbed per unit volume of the 
subject is of paramount importance. Therefore a unit of x-ray dosage, 
called the roentgen, was defined officially by the Fifth International 
Congress of Radiology in 1937 as follows: 


The roentgen shall be that quantit’ c&x- or gamma-radiation such 
that the associated corpuscular emis¢ron per 0.001293 g of dry air 
produces, in air, ions carrying one electrostatic unit of quantity of 
electricity of either sign. 


A few remarks about this definition are in order. 


0.001293 g is the mass of 1 cm? of dry air at 0°C and 760 mm Hg pressure. 
The associated corpuscular emissi 


by the x-rays. The greatest 
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chamber some of the photoelectrons produced in 1 cm? of gas will leave 
this volume and produce part of their ionization in other portions of the 
chamber. These ions are to be collected and counted. The task of 
doing this cannot be solved by simply making an ionization chamber with 
a volume of 1 cm®of dry air and collecting the resulting ionization, because 
in this case some of the photoelectrons would strike the wall and produce 
secondary electrons which would be counted. These secondary electrons 
are not ions produced by the associated corpuscular emission. The 
design of a reliable dosage meter involves careful consideration and 
balance among many factors. 

Because photons of 100 keV energy are much more readily absorbed 
than 100-MeV photons, beams of equal intensity of monochromatic 
100-keV and 100-MeV photons do not yield equal dosage rates; the 
100-keV beam corresponds to many more roentgens (r) per unit time. 
A dosage of approximately 500 r from a 200-kV x-ray tube is sufficient 
to produce reddening in a typical human skin and is taken as the threshold 
erythema dose. A dosage of 700 r over the entire body is sufficient to 
produce death in approximately half of the human beings subjected to it; 
for this reason it is known as the median lethal dose. 

Since the roentgen is defined for the passage of photons through air, 
it does not specifically involve biological effects; in this sense the roentgen 
is a measure of exposure dose to high-energy photons. The effect on a 
biological specimen is properly expressed by introducing an absorbed dose 
for which the rad! is the common unit. The rad is the absorbed dose of any 
high-energy radiation which is accompanied by the liberalion of 100 ergs 
of energy per gram of absorbing material (or 0.01 J/kg). For photons with 
energies between 0.3 and 3 MeV 1 r of exposure dose corresponds closely 
to 1 rad of absorbed dose in tissue. 

Although biological effects are produced by all ionizing radiations, 
the absorbed dose in rads required for a certain effect may be very differ- 
ent for different kinds of radiation. For example, 1 rad of neutrons is 
far more effective in producing cataracts in the eye than 1 rad of x-rays. 
This difference in response is taken into account by introducing the 
relative biological effectiveness (RBE), defined for any radiation as the 
ratio of the absorbed y-ray dose in rads to the absorbed dose of the 
specified radiation which produces the same biological effect. The dose 
unit for biological effects is called the rem for roentgen equivalent mammal, 
where 


Dose in rems = RBE X dose in rads 


1 An older unit, the rep or roentgen equivalent physical, corresponds to the liber- 
ation of 97 ergs/g of body tissue. 
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24.4 Energy Loss of Charged Particles 


When a charged particle with mass great compared with the mass of 
the electron moves through matter, it loses energy through electromag- 
netic interactions with electrons which are raised to excited states or 
torn away from atoms. The radii of atomic nuclei are so small compared 
with atomic dimensions that nuclear scattering and interactions are rare 
compared to interactions with electrons; therefore, 
may be neglected in a first approximation. 

‘In 1913 Bohr derived an expression for the space rate of energy loss 
for a charged particle on the basis of classical considerations. He con- 
sidered a heavy particle, such as an a particle or a proton, of charge ze, 
mass M, and velocity V passing an atomic electron of mass m at a 
distance b (Fig. 24.62). Bohr assumed that the electron was moving so 
slowly that it could be regarded as remaining at essentially the same point 
during the passing of the heavy particle. (Clearly this assumption is not 
valid if the electron acquires a speed comparable to that of the incident 
particle.) As the heavy particle passes, the electrostatic force acting 
on the electron changes direction continuously. However, if the electron 
moves negligibly during the passage of the heavy particle, the impulse 
JF. dt parallel to the path is zero by symmetry, since for each position 
of the incident particle in the —z direction there is a corresponding posi- 
tion in the +z direction which makes an equal and opposite contribution 
to the z component of the momentum. However, throughout the pas- 
sage, there is a force in the y direction, and the associated impulse 7, is 
given by 


nuclear reactions 


o x ze? sin 6 dr = ze? sin 0 d0 
I, = f? Fyd = o Arer? V T Jo ATreob V 


since z = —b cot 9 and dz = (b d6)/(sin? 6). 


The momentum transferred to the electron during the full passage 
isth efore 


ze? 


P=l,= reb (24.13) 


If the electron has not achieved a relativistic velocity, 


its kinetic energy 
is given by 


2 zes : 
a a = Bmebm Vi (24.14) 
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Fig. 24.6 (a) A heavy particle of charge ze passing an electron at distance 6. 
(b) A shell of radius 6 and thickness db with its axis the path of the heavy 
charged particle. 


To calculate the energy lost per unit path length by the heavy particle, 
we observe that the number of collisions per unit length for which 6 lies 
between b and b + db is equal to the number of electrons per unit length 
in a shell bounded by cylinders of radii b and b + db (Fig. 24.6b). If 
n represents the number of electrons per unit volume, the number of 
electrons per unit length in the shell is 2xbn db and the energy loss due 
to these electrons is 
dE(b) z'en 


dz ey Areo bm V? m 


while the total energy per unit path length lost by the heavy charged 
particle to electrons in all shells in the range between bmax and bmin iS 


_ dE _ frm: _ dE(b) Zein Oia 
de a ea areata bar 


(24.15) 


A glance at Eq. (24.15) shows why the integral over db is not 
taken from zero to infinity; if it were, the energy loss per unit path 
length would be infinite. It now becomes a challenge to select reason- 
able values for bmin and bmax- To choose a meaningful value for bmin, 
we observe that if the heavy particle collided head on with the electron, 
the maximum speed the electron could have is 2V, since in an elastic 
collision the velocity of separation is equal to the velocity of approach. 
The corresponding maximum kinetic energy (for a nonrelativistic V) is 
Knox = 7m(2V)? = 2mV*. If this value of Kmas is inserted in Eq. 
(24.14), the corresponding bmin becomes 


ze? 
«mV? (24.16) 


bmin = 
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If bmax is allowed to become infinite, —dE/dx goes to infinity 
because of the contribution of an unlimited number of small energies 
given to distant electrons. But the smallest energy an atomic electron 
can accept must be sufficient to raise it to an allowed excited state. If I 
represents the average excitation energy of an electron and we choose 
Kin = I, we find 


isee ae : 24.17) 
Qe ~/2mV7I (24. 


Bmax = 


When Eqs. (24.16) and (24.17) are substituted in Eq. (24.15), we obtain 


dE Zein 2mvV2 
~ dz ~ BreemV? In JÀ (24.18) 


This result is, of course, only a crude approximation. The value for 
bmin [Eq. (24.16)] is reasonable only in the nonrelativistic limit and only 
when the coulomb field of the heavy particle is essentially constant 
over the wave packet which represents the electron. This packet 
has a size about \/2r, where à is the de Broglie wavelength of 
the electron. In the center-of-mass frame the electron has a speed 
of almost V, and its de Broglie wavelength is A=hV/I1 = V2/c?/mV, 
where m is the rest mass of the electron. Quantum-mechanically, 
only values of b larger than \/27 are meaningful, and this leads to the 
prediction that 


ie AVL- Vie (24.19) 
mV 

Relativistic quantum-mechanical treatments have been given by Bethe, 

by Bloch, and by others. According to Bethe, if V < 0.95c, 


_dE zen 2mV? ye 

ie tes | aera z] (24.20) 

j The average excitation energy I is difficult to compute and is 
ordinarily treated as an adjustable parameter selected to fit experimental 
data. For most elements I in electron volts is roughly 13Z, where Z 
is the atomic number of the stopping atoms; for the very light elements 
hydrogen, helium, and beryllium the experimental values of I are 19, 
44, and 64 eV, respectively, for incident particles with energies in the 
MeV range. 


The energy loss per unit path length has a minimum when V ~ 0.96c. 
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Fig. 24.7 Energy loss per unit path length for a — 
heavy singly ionized particle passing through oxy- 
gen (760 mm, 0°C) as a function of particle speed 
(lower scale) and of the ratio of kinetic energy to 
rest energy (upper scale). 


For higher speeds the early theories predicted (Fig. 24.7) that — dE/dx 
should increase as the logarithm of the energy of the heavy particle 
because of a relativistic increase in the radius of action of the electric 
field of the penetrating particle. However, in 1938 Swann observed 
that polarization effects would screen distant electrons, and in 1940 
Fermi developed a theory showing that the polarization effects depend 
primarily on the electron density n. As a result of the polarization and 
the density effect, —dE/dz is almost constant in the extreme relativistic 
range. 

The quantity —dE/dz is called the linear stopping coefficient or the 
stopping power for the material through which the heavy charged particle 
passes. For a particle with V < 0.95c, this space rate of energy loss 
is (1) proportional to the number of electrons per unit volume in the 
stopping mterial and (2) proportional to the square of the charge and 
roughly inversely proportional to the square of the velocity of the incident 
particle. 

The stopping coefficient depends on the charge and the speed (but 
not on the mass) of the incident particle. Hence a measurement of the 
stopping power of a material for protons as a function of speed permits 
one to calculate the stopping power of a material for other charged 
particles, such as a particles, muons, or deuterons, provided V > V1/2m 
and provided the penetrating particle is moving so rapidly that it does 
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not pick up or lose electrons. Since |dE/dz| increases as V decreases, 
slower particles produce more ion pairs per unit path length than faster 
ones. 

The distance a charged particle moves through a stopping material 
before it is brought to rest is called the range of the particle. For a 
particle of initial energy E;, the range R is 


BE dE 
R= | yas Tiz (24.20a) 


In early experiments on nuclear transmutations, the energy of an emitted 
particle was usually deduced from its range in air at 15°C and 1 atm 
pressure (Fig. 24.8). In 1910 Geiger found that the range of natural « 
particles in air is given in centimeters by R ~ 0.32E! (E is the energy 
in MeV); Geiger’s rule is a good approximation for 4 MeV < E < 10 MeV. 

As an a particle approaches the end of the range, it may capture 


Range of a particles, cm of air 


Range of protons, cm of air 


Energy, MeV 


Fig. 24.8 Range-energy relations for œ particles and protons in standard air 
(760 mm, 0°C). (Data from E. Segrè, “ Experimental Ni Physics,” vol. 1, 
John Wiley $ Sons, Inc., New York, 1953.) Holer aE stce igo) 
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and lose electrons very rapidly; thus the effective charge varies between 
0 and 2. An a particle in air loses energy at a maximum rate when 
its residual range is about 0.4 cm; then it produces 6600 ion pairs per 
millimeter of path length, compared to about 2000 when it has many 
MeV of energy. The number of ion pairs per unit path length is called 
the specific ionization of the particle. For protons the specific ioniza- 
tion has a maximum at 2750 ion pairs per millimeter when it is about 
1 mm from the end of its range in air. 

Although magnetic and electrostatic deflection methods have long 
since displaced range measurements for accurate energy determinations, 
the energy loss of charged particles as they pass through matter continues 
to be a topic of considerable importance. In using the better tech- 
niques, it is still necessary to make corrections for the energy loss of 
particles in layers_of material on the source or in layers through which 
the particles pass. To make such corrections it is necessary to know the 
differential rate of energy loss with path. 


24.5 The Stopping of Electrons 


A high-energy electron passing through matter loses energy both by 
ionization and excitation of atoms and by the emission of electromag- 
netic radiation. For energies up to a few MeV the former processes 
are dominant, but at very high energy radiation losses are the more 
important. 

There are two principal reasons why Eq. (24.20) is not applicable 
to electrons: (1) in the derivation it was assumed that the incident 
particle was undeflected, while when an incident electron interacts with 
another or with a nucleus, the transverse momentum given to the incident 
particle is far from negligible; there is considerable scattering of the 
electrons in encounters with electrons and nuclei; (2) for collisions between 
identical, and hence indistinguishable, particles one cannot tell which 
of the two final electrons was the incident one; in the quantum-mechanical 
theory there exist important exchange phenomena which alter the results 
substantially. The ionization-excitation loss for electrons is given by a 
formula due to Bethe: 


_dE\ S net [in VEK 
de lon Brem: | “2A — p) 


— (2 yVI=-P-1+8)m2+1-+:> ‘| (24.21) 


where K is the kinetic energy, V the speed of the electron, = 
all other symbols have the same meaning as in Eq. eae B = V/c; 
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Bethe and Heitler have shown that when K >> mc?, the differential 
energy loss due to bremsstrahlung (continuous x-radiation primarily from 
acceleration of the electrons in the electric fields of nuclei) is given by 


dE) _ 4ZeNK(Z + 1.3)[In (183Z) + 4) 
midea Areo hmc (24.22) 


where N is the number of atoms per unit volume of atomic number Z. 
The radiation loss is proportional to the kinetic energy K of the electron 
(or positron), whereas the ionization loss is Proportional to In K and 
inversely to the square of the velocity. Thus the radiation loss increases 
very much more rapidly with K than the ionization loss. For lead the 
radiation loss is equal to the ionization loss at 7 MeV, while for hydrogen 
the equality comes at 340 MeV. ~ 

In the extremely high energy range the energy loss per unit path 
length is proportional to the kinetic energy K. This leads to an exponen- 
tial loss of energy as the electron or positron passes through a material. 


—dE/dz in air, keV/em 


—dE/dx in lead, MeV/em 


Fig. 24.9 Theoretical curves for energy loss per unit path length by charged 
particles as a function of the kinetic energy E of the particle. Curves e-ec 
and e-el: collision loss and total (including radiation) energy loss for electrons 
in standard air. Curves ePb-ecPb and ePb-efPh 


s : ionization-excitation loss 
and total loss for electrons in lead. Curves m-207 and p: total loss in air bya 
muon and by a proton. 
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In this case —dE/dz = —dK/dz = K/A, where A is called the radiation 
length. In the region where this relation holds, K = Koe~7/4. For vari- 
ous materials it is customary to tabulate Ap in grams per square centi- 
meter, where p is the density of the stopping material. For air and water 
Ap is about 36 g/cm? when K > 85 MeV; for iron Ap = 14 g/cm? when 
K > 24 MeV; for lead Ap = 6 g/cm? when K > 7 MeV. Figure 24.9 
shows curves for the energy loss of electrons in air and in lead along with 
air data for a muon and a proton for comparison. 


24.6 Cerenkov Radiation 


When a charged particle moves through a medium with refractive index n 
with a speed greater than the speed c/n of light in the medium, coherent 
light is emitted which is analogous to an acoustic shock wave and to the 
bow wave of aship. This phenomenon was discovered and investigated 
quantitatively by Gerenkov in 1934. Three years later Frank and 
Tamm explained the Cerenkov radiation on the basis of classical electro- 
magnetic theory. From the early days of radioactivity it was known 
that various substances, such as solutions of mineral salts, emitted feeble 
light when exposed to the radiations from radioactive materials. In 
some cases the light was fluorescent in origin. In 1929 Mallet recorded 
ultraviolet spectra from water irradiated by y rays, which came from 
Gerenkoy radiation although the latter had not yet been identified. 

Acparticle of charge ze moving through a dielectric medium has an 
accompanying electric field that temporarily polarizes the atoms. They 
follow the waveform of the pulse and, in'so doing, reradiate. If the speed 
V of the particle is less than the local speed of light, the scattered radia- 
tions from all segments of the track interfere destructively. However, 
when V exceeds c/n, scattered radiation from all points on the track can 
be in phase in certain directions. A coherent wavefront (Fig. 24.10) 
is propagated through the medium in the directions AC which satisfy 
the condition 


n 
cos 0 = un = F (24.23) 


The electric vector of the Čerenkov radiation is perpendicular to the 
surface of the cone. 


According to the theory of Frank and Tamm, the energy | 
F , oss per 
unit path length due to Cerenkov radiation is 7 5 


iz dE d, Tze? c? 
dx e J (Q F T) mde (24.24) 
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Wavefront 


Fig. 24.10 A charged particle moving through a 
medium at a speed V greater than the local speed 
c/n of light emits Cerenkoy radiation at angle 0 
such that cos @ = c/nV. 


where the integral extends over all frequencies for which Vn/c > 1. 
This limits the radiation to frequencies less than those of the ultraviolet 
absorption bands of the medium. Below this limit the energy loss goes 
as vdv (or dd/d3), which reveals that the light is concentrated in the 
violet end of the spectrum. From Eq. (24.24) one finds that a fast 
(V = c) electron passing through water emits about 23 visible photons 
per millimeter of path length. 

Cerenkov radiation is not a major cause of energy loss from the 
point of view of stopping the particle. For example, a 100-MeV elec- 
tron in water loses about 2 MeV/cm in ionization, an equal amount in 
bremsstrahlung, and only 2.7 keV/cm in Cerenkov radiation. For a 
1-GeV proton (V/c = 0.87) the ionization loss is again 2 MeV/cm, while 
the Cerenkov loss is about 1.7 keV/cm and the bremsstrahlung loss is only 
10 eV/em. Although the Cerenkov loss is small, it is extremely impor- 
tant because (1) the light is concentrated in the visible and near ultra- 
violet, (2) it appears at a well-defined angle and with well-defined 
polarization, and (3) the Cerenkov pulse is sharp (K 10-19 s). These 
features make it possible to develop energy-selective directional count- 


ers, capable of detecting and of measuring the speed of high-energy 
charged particles. 


24.7 Detection of Charged Particles 


In most detecting devices, the passage of a charged particle is mani- 
fested by its ionization of the medium through which it passes. An 


E) 


ee 
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a particle, for example, in passing, through a gas will both ionize and 
excite the atoms along its path by the action of its electric field on the 
atomic electrons, producing free electrons and positive ions. In many 
gases—oxygen and chlorine are examples—the electrons so released 
‘immediately attach themselves to neutral atoms and form negative 
ions; in others, like the noble gases, the electrons may travel by them- 
selves. On the average, a charged particle loses about 34 eV of energy 
for each ion pair that is formed. Both y rays and x-rays can be detected 
through their interaction with atoms; in the photoelectric or Compton 
processes, for example, part of the energy of the radiation is converted 
into kinetic energy of an electron, and the electron is then detected 
through its ionizing power. 


a. Ionization Chambers and Counters In electrical detection methods, 
the ions produced by the passage of a charged particle are separated by 
means of an electrostatic field. The ions move toward the electrodes _ 
and on arrival are registered by an electrometer or some type of vacuum- 
tube amplifier. A typical arrangement is shown in Fig. 7.2. Particles 
may enter through a thin windąw or, in the case of y rays, secondary 
electrons may be ejected from the wall. If a 5-MeV a particle, for 
example, is stopped in the gas of the chamber, it will produce about 
5 X 106/34 = 150,000 ion pairs. Supposing the wire and measuring 
system to have a capacity of 10 pF, the potential of the wire will be 
changed by 


4 105 X 1.6 x 10-19 
15 10 Re 0.0024). 

With a reasonably good amplifier, a pulse of this size is easily detectable. 

It is ordinarily preferable to use a noble gas since electrons move much 

more rapidly in the collecting field than negative ions do, and recombina- 

tion of ions in the gas is minimized. 

With a sufficiently fine wire and high positive voltage on the wire, 
it is possible to obtain so strong a field that electrons nearing the wire 
are accelerated to the point where they may produce further ions. The 
new electrons so formed are again accelerated and produce still more 

ions, resulting in a multiplication of the original charge by a factor which 
may in practical cases be as much as 1000. Under these conditions the 
device is called a proportional counter, since the pulse size is propor- 
tional to the original ionization. The counter used by Rutherford and 
Geiger in their determination of the number of a particles from radium 
(Fig. 8.2) was of this type. At still hig ker voltages, the so-called Geiger 
counting region is reached; in this condition the rate of multiplication 
is so great that the discharge spreads through the whole of the counter, 
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giving rise to a current which is limited only by the power supply. In 
practice, a limiting resistance is inserted in the circuit to lower the 
voltage and quench the discharge. The pulse output represents then 
merely a count, and its magnitude bears no relation to the number of 
original ions. A Geiger counter will easily respond to individual 8 
particles. 


b. Scintillation Counlers Many of Rutherford’s early experiments 
on charged particles were made by counting scintillations in a zinc sulfide 
screen. When an ionizing particle passes through a crystal lattice such 
as that of zinc sulfide, it excites electrons in the lattice, and in the course 
of rearrangement the lattice may reemit some of the absorbed energy 
in the form of light. It is important that the emitted radiation be of a 
wavelength which is not strongly absorbed by the lattice. The action 
in the case of zinc sulfide is dependent on a trace of impurity such as Cu 
or Ag. A number of other substances, both inorganic and organic, have 
the same property. In modern applications a photomultiplier tube is 
used to detect the scintillations, and a response which is directly propor- 
tional to the energy loss in the scintillator can be obtained. A typical 
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Fig. 24.11 Scintillation counter (schematic dia- 
gram). 
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arrangement is shown in Fig. 24.11. The light emitted by the scintillator 
is collected on a photosensitive surface which comprises the cathode of 
the photomultiplier tube. The photoelectrons are accelerated by an 
electric field to the first electrode; at this electrode each impinging elec- 
tron produces two to six secondaries, and these in turn travel to the 
second electrode, there multiplying again. The whole process, which 
may continue through 10 or more stages, requires less than 10-* s, and 
the amplification is sufficient to allow direct display of the pulse on an 
oscilloscope screen. 


c. Cloud Chamber The path of an ionizing particle through a gas 
can be made visible by means of the cloud chamber, invented by C. T. R. 
Wilson in 1912. This device consists essentially of a cylindrical or 
rectangular chamber provided with transparent windows and closed at 
the bottom by a movable piston or a flexible diaphragm. In the chamber 
are contained a gas and a supply of some liquid which has a reasonable 
vapor pressure at room temperature; a mixture of ethyl-alcohol and 
water is often used. After the vapor has reached equilibrium, the 
chamber is suddenly expanded by pulling out the piston or diaphragm, 
and the gas is rapidly cooled, producing a condition of supersaturation 
of the vapor. In the absence of any condensation nuclei, the super- 
saturation can persist for a long time, but if any ions are present, they 
act as centers on which droplets of liquid can grow. Thus, with suitable 
illumination the trail of ions which mark the path can be seen and 
photographed. If a magnetic field is provided, the tracks are curved, 
and from a measurement of the reprojected images the magnetic rigidity 
can be determined. If the particle velocity is nonrelativistic, i.e., not 
too near the velocity of light, the density of droplets per unit path length 
can be used to estimate Z/v. 


d. Photographic Plate One of the earliest detection methods—and 
with some modifications, one of the most modern—makes use of the 
photographic plate. Réntgen’s and Becquerel’s initial investigations 
were based on the blackening of the photographic plates by charged 
particles. For reasons which are not yet completely understood, the 
production of a few hundred ions in the neighborhood of a silver halide 
grain in the emulsion renders the grain developable. Whena large num- 
ber of such events have occurred in a small region of a plate, that region 
is blackened in the development process. Ordinary emulsions contain 
only a small amount of silver compared to the other components present, 
and hence many ionizing events may be required to produce a detectable 
effect. With recent development of emulsions containing over 80 per- 
cent silver halide by weight, however, the sensitivity is much increased, 
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and under microscopic examination of the developed plate, the path of 
each ionizing particle appears as a track of silver grains. The photo- 
graphic-plate method has had ‘considerable application, Particularly in 
the study of cosmic rays, where emulsions sufficiently thick to stop even 
very high-energy particles are used. 


e. Bubble Chamber In 1952 Glaser invented the bubble chamber, in 
which a charged particle moves through a superheated liquid rather 
than a supersaturated gas as in a cloud chamber. Bubble chambers 
containing liquid hydrogen have proved to be particularly valuable for 
studying many high-energy reactions for which the particle tracks could 
not be completely shown in a cloud chamber of reasonable size. The 
charged particle interacts with many more atoms per unit path length 
in a liquid than in a gas and loses its energy in a much shorter distance. 
In the bubble chamber the liquid is kept at an elevated pressure with its 
temperature just too low for boiling to occur. When a burst of charged 
particles enters the chamber, the pressure is suddenly reduced. Ions 
produced by the particles serve as nuclei on which bubbles form prefer- 


entially. The bubble tracks are photographed and the high pressure 
restored before general boiling begins. 


S. Spark and Streamer Chambers If a metal plate or wire is charged 
to a high potential and another conductor is placed nearby, a spark may 
jump the gap between the conductors when a “bridge” of ions is created 
across the gap by the passage of an ionizing particle. A common form 
of spark chamber is constructed from a series of closely spaced parallel 
metal plates with alternate plates charged to a high potential by a short 
pulse triggered by the arrival of an ionizing particle. Sparks leap through 
the noble gas (often a mixture of helium and neon) filling the region 
between the plates, revealing the path of the particle by the location 
of the sparks. The spark track is photographed and analyzed much as 
a cloud-chamber photograph is. Another form of spark chamber utilizes 
a two-dimensional array of parallel wires to locate the spark path by 
electrical signals which can be fed directly into a computer. 

In the streamer chamber an extremely short pulse is applied 
between conductors So short is the pulse that it produces very short 
flashes, called streamers, which are localized along the path of the particle. 
As photographed in the direction perpendicular to the field, the track 
consists of a series of short parallel streamers; photographed parallel to 
the field lines, the track appears as a series of dots. The streamers are 
localized and do not extend, from conductor to conductor. Conse- 
quently the streamer chamber can delineate a track parallel to the 
conductors and can show simultaneotsly many tracks, a major advantage 
over the spark chamber. 


Ta 
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1. Assuming that 7a = CZ*)?, where C = 2.6 X 10- m2/A®, and that 
a = Zoa calculate the atomic attenuation coefficient of copper for 
à =0.2 Å and à = 0.7 Å. (Handbook values are 164 and 512 barnes, 
respectively.) f 

2. Starting with Eqs. (24.6a, b), derive Eqs. (24.7), (24.9), and (24.9a). 
3. Derive asymptotic expressions for Eq. (24.10) for high energies 
[Eq. (24.10b)] and for low energies through the termin to the first power 
[first two terms in Eq. (24.10a)]. 

4. Estimate the mass scattering coefficient. of magnesium for 20-MeV 
bremsstrahlung. p 

5. Calculate the number of ion pairs produced per gram of dry air 
by an exposure dose of 1 r. Assuming that on the average it requires 
33 eV to produce one ion pair, calculate the energy dissipated per gram 
of air by an exposure dose of 1 r. 

6. A beam of 1.5-MeV photons passes through a sodium iodide crystal 
0.8 cm thick. With the aid of Fig. 24.4 make a rough estimate of what 
fraction of the incident intensity appears beyond the crystal in the case 
of (a) a narrow-beam experiment, and (6) a broad-beam experiment. 

7. Prove that the smallest energy a photon can have and stil] produce 
a pair in the field of a free electron is 2.04 MeV. 

8. Find the maximum energy which a 10-MeV proton can impart to an 
electron initially at rest. ; 

9. The specific ionization of a particle is the number of ion pairs pro- 
duced per unit path length. If it requires 33 eV on the average to pro- 
duce an ion pair in air, estimate the stopping power of standard air 
and the specific ionization for 9-, 4-, and 2-MeV a Particles from the 
curves of Fig. 24.8. 

10. An experimental value for the atomic stopping cross section of 
copper for 1-MeV protons is 1.3 X 10-18 eV-m?/atom. From this value 
estimate the atomic stopping cross section of Cu for 4-MeV a particles 
for which the experimental value is 5.1 X 10-18 eV-m*/atom. 

11. Calculate —dE/dz for 5-MeV Protons and 20-MeV « particles in 
air by use of Eq. (24.18). Compare your result with the values implicit 
in Fig. 24.8. 

12. For protons in the visible region the index of refraction 
for glass, lucite, or mica. Calculate the angle 6 for Čerenk 
for (a) a 100-MeV electron and (b) a 1-GeV proton, 


13. Prove that Photons are concentrated in the short-wavelength region 


of the Cerenkov spectrum by showing that the number of i 
wavelength between \ and A + dì is Proportional to 1 Ae. Photons witii 


is about 1.5 
ov radiation 
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14. (a) Show from Eq. (24.24) that the number N of Cerenkov photons 
emitted per length L with wavelength between à and àz is 


N - 1Z%e? ee ct hy pE 
L ehe Vn) he 


providing n is essentially constant over the wavelength range in question. 
(b) When an electron with V ~c moves through water with 


n = 1.33, how many photons are emitted per centimeter of path with 
wavelength between 4000 and 6000 Ad 


15. A magnetic analyzer, used to select particles of a given momentum 
passes both muons and charged pions. In certain energy ranges it is 
possible to tell which particles in the beam are muons by use of a Cerenkov 
counter. Find over what momentum range (in units of MeV/c) a 
Gerenkov counter with index of refraction 1.33 will give a pulsé for a 
muon but none for a pion of the same momentum. 7 


chapter twenty-five 


The Nucleus 


We turn now from a consideration of crystalline aggregates of atoms 
to a further study of atomic nuclei, their properlies, and structure. It 
occurred to a number of physicists, afler the high energy of the radialions 
from radioactive substances was discovered, that these radiations might 
be capable of inducing transmulations in olher elements. Several experi- 
ments were tried in which radioaclive materials were mized with stable 
substances and the products of transmutation searched for by chemicalanalysis. 
Produclion in this way of neon and argon from water was reported as early 
as 1907, but later experiments failed to substantiate this claim and the effect 
was generally altributed to contaminalion. Actually the number of atoms 
which one could hope to transmute by such procedures is so small as to 
preclude delection by chemical means; it was not until methods had been 


developed whereby individual events could be detected that artificial trans- 
mulation could be demonstrated. 
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25.1 Discovery of Artificial Transmutation 


The first conclusive evidence for artificially induced transmutation wae 
obtained by Rutherford in 1919, in connection with some experiments 
on the scattering of æ particles in various gases. The apparatus used 
in these studies is illustrated in Fig. 25.1. Ina gastight chamber B is 
mounted a source D of Po?!4, which emits æ particles of about 7 cm range 
in standard air. A thin silver foil W covers a hole in the end of the 
chamber, and a zinc sulfide scintillation screen F, observed with a micro- 
scope M, permits counting of individual particles coming through the 
window. Provision is made for inserting absorbing foile between. the 
window W and the screen F and for varying the distance z between the 
source and the window. The whole apparatus is placed in a transverse 
magnetic field to eliminate the £ rays. In the first experiments, carried 
out by Marsden in 1914, hydrogen gas was used in the chamber, and 
scintillations due to the hydrogen recoils were observed with absorbers 
having thicknesses up to about 28 cm air equivalent. This range was 
approximately in agreement with expectations. If we consider the con- 
servation of momentum and energy in an elastic collision between an 


incident particle of mass Mo and the struck particle, of mass Mı, the 
energy transferred is 


£ 4MM . ,@ 
E = Eo TM + My? sin? 5 (25.1) 


where 0 is the angle of deflection of the incident particle and Es is the 
initial energy. For a Po* æ particle striking a proton head on, the 
maximum energy transfer is 4$ X 7.680 MeV = 4.9 MeV, and the range 
of such a proton in air is 32.5 cm (Fig. 24.8), in reasonable agreement 
with the value found by Marsden. 


i 
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Fig. 25.1 Apparatus by which Rutherford observed production of protons 
in the disintegration of nitrogen by a particles. The movable source is 
at D; particles produced by transmutations in the gas strike the screen F, 
giving rise to scintillations observed by use of the microscope M. 
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In an investigation of certain peculiarities of the effects in hydrogen, 
Rutherford made some tests in 1919 with air in the chamber, with the 
unexpected result that long-range particles appeared again. Further 
work showed that the effect was due to nitrogen, that neither oxygen nor 
carbon dioxide produced similar particles, and that the particles were 
much too numerous to be ascribed to hydrogen contamination. Rough 
magnetic analysis indicated that the particles probably were protons, 
and Rutherford concluded that they must have been knocked out of the 
nitrogen nucleus by the impact of the « particles. The number observed 
was about 1 per million a particles. The work was extended by Ruther- 

` ford and Chadwick in 1922, and protons with ranges as long as 40 cm were 
observed from nitrogen. Five other elements, boron, fluorine, sodium, 
aluminum, and phosphorus, were also found to yield long-range par- 
ticles; with aluminum as the target, they observed protons with ranges 
up to 90 cm air equivalent. In a later version of the experiment (1924), 
the protons were observed at 90° to the a particles, where no elastically 
scattered protons from hydrogen impurities would be expected at all. 
With this apparatus, they could be sure that any particles they saw at 
ranges greater than 7 cm (the maximum range of the a particles) were 
due to processes other than elastic scattering. In testing a variety of 
target materials, they found that all the light elements up to and includ- 
ing potassium could be disintegrated, with the exception of hydrogen, 
helium, lithium, beryllium, carbon, and oxygen. 

Rutherford’s original surmise that the « particle knocked out a pro- 
ton from the target nucleus turned out to be only partially correct. Ina 
series of cloud-chamber pictures of a particles traversing a nitrogen 
atmosphere, Blackett photographed eight disintegrations (in 415,000 «œ 
tracks). In all eight cases, the a track terminated in a fork, with a 
lightly ionizing particle, identifiable as a proton, as one branch and a 
heavy recoil, similar in behavior to the nitrogen recoils observed in elastic 
scattering, as the other. In no case was a third track corresponding 
to the a particle itself visible after the collision. From this it was con- 
cluded that the a particle had been absorbed into the nucleus, releasing 
a proton and forming a residual nucleus 3 u heavier than nitrogen and 
with one more unit of charge. In the terminology of nuclear reactions, 
the events observed by Rutherford, Chadwick, and Blackett may be 
written 


aN + Het — 0" + H? 
where the conservation of charge and mass number are explicitly exhib- 


ited. The existence of 0” in ordinary oxygen was unknown in 1925. 
From the range of the protons observed, the energy release could be 
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calculated. In the case of nitrogen, this turned out to be negative; Ken 
part of the kinetic energy of the a particle was used up in forming the 
two products, but in several other instances a positive energy release 
was observed. In aluminum, for example, a 90-cm proton was produced, 
corresponding to a kinetic energy of 8.6 MeV. Even ignoring the kinetic 
energy of the residual nucleus, this represents a gain of nearly 1 MeV over 
the initial kinetic energy of the a particle. 

At the time of these experiments, the question arose whether the 
new nuclides produced were stable, and a systematic investigation was 
undertaken to detect radioactive effects, with negative results. Unfor- 
tunately the detection methods used 


were insensitive to 8 rays; otherwise - 
the discovery of the activity induced in aluminum, among others, might 
have been anticipated by more than a decade. 


25.2 Discovery of the Neutron 


distinct groups of protons, A pa 
made by Pose in 1930, when he fo 
maxima for certain a-particle energies; 
nance, predicted by Gurney in 1 
say in Sec. 25.13. 

The first observation of nuclear r: 
the transmutation of the li 


property of these radiations. In 
coefficients in various materials, t 
schematically in Fig. 25.2. «æ particl 


O a 


The Ni ucleus 665 


Neutron 


Alpha faf 
particles 


B EA SGA A 
Fig. 25.2 Arrangement used by the Joliot-Curies to study the 


recoils ejected from various materials by radiation produced in the 
a-particle bombardment of beryllium and boron, X 


on the onization chamber of the soft y radiation from the Po source. 
Inserting absorbers of such materials as Al, Cu, Ag, and Pb in front of 
the ionization chamber, at C in the figure, they found only a slight dim- 
inution in intensity, but with paraffin, water, and cellophane, they 
actually observed an increase in the ionization-chamber reading. The 
greatest effect was observed with paraffin, which produced an increase 
by a factor of 2. Acting on the conjecture that the effect was due to 
protons ejected from the paraffin by the radiation, the Joliot-Curies 
placed aluminum absorbers between the paraffin and» the ionization 
chamber (atA in the figure) and found that 0.2 mm of Al was sufficient 
to destroy the effect. The increase in ion-chamber current was therefore 
not attributable to fast electrons, and tests with a magnetic field showed 
that no easily deflectable particles, like slow electrons, were responsible. 
This evidence and the association of the effect with hydrogenous sub- 
stances made the identification with protons reasonably certain, Know- 
ing the range-energy relation for protons in aluminum, they concluded 
that the protons had an energy of about 4.5 MeV, and concluded that 
they were recoils from a high-energy y ray. 

From Eq. (7.17a) for the Compton 


process, the maximum energy 
transferable to a proton by a y ray of energy hy is 
Qe hy 
Emax = hv T+ 2 LF Me? (25.2) 


where M is the mass of the proton. . Using their value of 4.5 MeV for 
the proton energy, they arrived at a y-ray energy of approximately 
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50 MeV. A similar examination of the radiation from boron bombarded 
with « particles yielded a y-ray energy for this reaction of 35 MeV. 
Two serious difficulties involved in this interpretation were pointed 
out by Chadwick in his famous paper of 1932. In the first place, the 
number of protons ejected in the Compton process could be calculated 
from the Klein-Nishina formula and was many thousand times too small 
to account for the observations. In the second place, he showed that, 
even allowing rather generous limits for the errors in the masses involved, 
no conceives reaction could produce the required energy. The most 
energetic possible reaction would be given by the simple capture of the 
æ particle by the Be’, forming C'*; the difference in mass energy be- 
tween Be? + He‘ and C!* could then be radiated as a y quantum. The 
mass of C! and of the æ particle were known, and an upper limit for 
the mass of Be? could be obtained from the fact that Be? is stable.1 The 
mass difference Be? + Het — C! added to the kinetic energy available 
from the « particle gave an upper limit for the energy of the y ray of only 
14 MeV—considerably less than the 50 MeV reported. 

In extension of the work of the Joliot-Curies, Chadwick redeter- 
mined the range of the recoil protons from hydrogen, obtaining a range 
of 40 cm, corresponding to a velocity of 3.3 X 107 m/s and an energy 
of 5.7 MeV. He also observed recoils from a number of other light 
substances, including nitrogen, and made rough measurements of the 
recoil energies. From Eq. (25.2), the maximum energy transferred by 
a y ray should be roughly inversely proportional to the mass of the 
struck nucleus [note that 2e<1, so that Emax ~ 2(hv)?/Mc?], and so 
it would be expected that nitrogen recoils would have an energy of 
5.7/14 MeV. The observed energy was more than 3 times this value. 

Thus the assumption that the observed effects were produced by 
high-energy y rays was untenable. If, on the other hand, the radiation 
consisted of uncharged particles of about the same mass as the proton 
all difficulties were resolved. Such particles, having no interaction with 
the atomic electrons, would be able to pass relatively freely through mat- 
ter, losing their energy only through comparatively rare nuclear collisions 
In such collisions they could transfer a large fraction of their ener, ; 
and the observed recoils could be accounted for with only A 
primary energies. The existence of such particles, called neutrons, had in 
fact been suggested by Rutherford some 12 years earlier; many atte ts 
to observe them had been made at the Cavendish Laboratory ae 
Chadwick’s discovery. 

Comparison of the recoil energies from hydrogen and nitrogen per- 


1 If the mass of Be’ were greater than the sum 2He! + n}, it ld disi 
spontaneously. Chadwick assumed, in his preliminary patie oa Santeri 
pte. = 
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mitted a quantitative determination of the mass of the neutron. From 
the ordinary laws of elastic collisions, the maximum velocity which can 
be imparted to a proton by a particle of velocity Vo having a mass equal 
to Mo times the proton mass is 


2Mo 


Va = Vo M +1 : (25.3) 


while the maximum velocity of a nitrogen recoil will be 


2Mo 
Vx = Vom + 14 (25.3a) 
Dividing these two relations to eliminate Vo, we obtain 


Vu _ Mo+14 
T- Wee (25.4) 


For Va Chadwick used his value.of 3.3 X 107 m/sec; Vx was obtained 
from cloud-chamber measurements by Feather as 4.7 X 10° m/s. 
Inserting these values in Eq. (25.4), Chadwick obtained My = 1.15 with 
an estimated error of about 10 percent. The velocity of the neutrons 
was 3.2 X 107 m/s, corresponding to a kinetic energy of about 6 MeV. 

Chadwick suggested that the reaction involved in the production of 
neutrons from a-particle bombardment of beryllium is 


sBe® + 2Het — C! + nt 


where n’ refers to the neutron. From the known masses, again assuming 
an upper limit for Be®, he concluded that a maximum of 8 MeV was 


available for the neutron when polonium a particles (5.3 MeV) were used. 
In the case of boron he assumed the reaction 


B! + Het N+ nt 


Here the masses were all known, and trom the energy balance he was 
able to obtain a mass for the neutron of 1.0067 u, with a probable error 
of about 0.1 percent. 


Nuclear Binding Energies and Nuclear Forces 


The discovery of the neutron revolutionized our ideas about 


s KAI th 
of nuclei; until that time it was generally believed tha gstructure 


t nuclei con- 
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tained protons, electrons, and, possibly, œ particles. The theory of 
nuclear structure based on this assumption was fraught with perplexing 
difficulties, particularly with respect to the properties of electrons in the 
nucleus. The introduction of the neutron as a nuclear constituent elim- 
inated some of these problems and provided a much more satisfactory 
picture of the composition of nuclei. The theory has still many dificul- 
ties, it must be confessed, but at least they are now of another kind. 


25.3 Properties of Nuclei 


Before we embark on a discussion of the theory of nuclear structure, it 
will perhaps be appropriate to review some of the more important proper- 
ties of nuclei as they were known in 1932. In the ensuing discussion 
we shall have the advantage of another 35-odd years of development, 
but most of the concepts we shall need for our preliminary discussion were 
-at hand by 1932. 


a. Charge We begin with the nuclear charge. Since atoms are nor- 
mally neutral, the nuclear charge is equal in magnitude and opposite in 
sign to that of the atomic electrons. Since the latter all have the same 
charge e, the nuclear charge is Ze, where Z is an integer. The value of Z 
is known from x-ray scattering experiments, from the nuclear scattering 
of a particles, and from the x-ray spectrum. 


b. Mass The masses of nuclei, determined by measurement of the 
ratios of charge to mass in the mass spectrometer (Sec. 10.4) and by 
nuclear reaction energies (Sec. 25.10), are all very close to integers when 
expressed in terms of C!? = 12 exactly. The mass number A is about 
2Z for light elements and increases to about 2.5Z for heavy ones. 


c. Size The “size” of a nucleus, like that of an atom, is not a uniquely 
defined quantity, since the specification depends upon what property is 
under consideration. If two atoms approach one another, they may 
exhibit a weakly attractive force at larger distances, which changes at 
short distances to a strongly repulsive force. The distance at which 
this change occurs may be used as a measure of the sum of the radii of 
the two atoms. On the other hand, the “radius” may also be defined 
in terms of the distribution of electronic charge in the atom. The two 
definitions may give different results; both are subject to arbitrariness 
because the functions involved change slowly with distance. In the 
nuclear case, too, various definitions of the radius are used, but all 
agree fairly well; in each case the measurements suggest a rather sharp 
boundary. 
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One useful measurement of the size of nuclei comes from a-particle 
scattering experiments of the kind described in Sec. 8.4. In these experi- 
ments, a beam of a particles is directed onto a thin foil of the material 
under study and the number of « particles scattered at various angles is 
determined. For moderate a-particle energies, the experimental results 
are consistent with the assumption of a simple inverse-square force acting 
between the nucleus and the «æ particle. As the a-particle energy is 
increased, deviations from the Rutherford scattering law appear, par- 
ticularly at large angles. These deviations are associated with the fact 
that the a-particle trajectory reaches closer to the nucleus at the higher 
energies, and can thus be interpreted as a failure of the inverse-square 
Jaw at small distances. Analysis of the scattering experiments leads 
to the conclusion that the radius at which the force law changes varies 
from a few times 10-*5 m in light nuclei to about 10-4 m in heavy nuclei. 

Scattering of high-energy neutrons by nuclei (Sec. 25.15) also givesa 
measure of the extent of the force field around a nucleus. Since neutrons 
are unaffected by the electrostatic forces, their scattering must. be inter- 
preted in terms of some nonelectrical force. The experimental results 
indicate an attractive force which sets in rather abruptly at short dis- 
tances. The radius at which the force begins to act strongly varies 
approximately as the cube root of the mass number and appears to be 
well represented by the expression 


R= RA! Ro =1.1t015 X 10- m (25.5) 


We shall discuss in later sections still other determinations of the 

nuclear radius as defined in various ways and shall find that all are 
reasonably consistent with Eq. (25.5). 
d. Spin The spin—better referred to as the total angular momen- 
tum—of the nucleus can be obtained from the hyperfine-structure pat- 
tern, from spectra of homonuclear molecules, or by atomic-beam methods. 
The magnitude of the angular momentum is specified, in units of ħ, by 
the number J,' where J is the maximum possible projection of the angular- 
momentum vector on any arbitrary axis in space. In nuclei with even 
A, I is found to be an integer or zero; the ground state I is always zero 
if both Z and A are even. In nuclei with odd A, I is a half-integer. 
Protons, electrons, and neutrons have spin J =}, 


e. Magnetic Moment A magnetic dipole mom 

with nuclear spin; the magnitudes of nuclear mo} 

spacing in hyperfine structure and from atom 

of the order of the nuclear magneton eħ/2 
1 The symbol J is sometimes used. 


ent is always associated 
ments, obtained from the 
ic-beam experiments, are 
M, where M is the proton 
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mass. They are thus about one onc-thousandth of the magnetic moment 
of the electron. Nuclei with zero spin have no magnetic moment. 


f. Statistics Quantum-mechanical systems can be divided into two 
classes depending upon what kind of statistics are obeyed by the particles 
composing them (Secs. 20.3 and 20.4). If the wave function describing 
two identical particles, e.g., two electrons with the same spin direction, 
changes its sign when the space coordinates of the two particles are 
interchanged, the wave function is antisymmetric with respect to the 
exchange and the particles obey Fermi-Dirac statistics. For the other 
class of particles, the wave function is symmetric—does not change 
sign—and the particles obey Bose-Einstein statistics. F-D particles obey 
the Pauli exclusion principle, while B-E particles do not. Experimentally 
it is found that all particles with half-integral spins, e.g., protons, neutrons, 
and electrons, obey F-D statistics, and all those with integral or zero 
spin obey B-E statistics. All nuclei with odd A obey F-D statistics, 
while those with even A obey B-E statistics. 


g- Parity The parity of a system refers to the behavior of the wave 
function under inversion of the coordinates through the origin, i.e., when 
z is replaced by —z, y by —y, and z by —z. If the potential-energy 
function is unchanged by this process [P(z,y,z) = P(—z,—y,—z)], then 
it can be shown that the wave function may either remain the same or it 
may change sign: it is seid to have even parity if the sign remains the 
same; if the sign changes, the parity is odd. The wave functions describ- 
ing a particle in a box have alternately even and odd parities if the 
potential is symmetric in z. The parity of the wave functions of the 
one-electron atom are even or odd as l is even or odd. Nuclear states 
are characterized by a definite parity (which may be different for 


different states of the same nucleus), and the conservation of parity has 
an important bearing on nuclear reactions. 


h. Electric Quadrupole Moment A system of charges with nonzero 
spatial extension may “produce, in addition to the ordinary radial elec- 
trostatic field of force, fields of higher complexity, characterized as dipole, 
quadrupole, and higher-multipole fields. There is good evidence that 


nuclei do not have electric dipole moments, but electric quadrupole 
moments have been observed in many nuclei. 


25.4 Constituents of Nuclei 


The fact of the transmutability of nuclei argues that they are complex 
systems, composed of some common constituents. The integral values 
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of Zand the near-integral values of M suggest the proton as one candidate, 
while the occurrence of œ particles and 8 particles as decay products 
early suggested electrons as fundamental constituents and helium nuclei 
either as constituents or as particularly stable subassemblies. Thus, in 
the early theories, a nucleus of charge number Z and mass number ^“ 
was thought of as comprising A protons and A — Z electrons, grouped 
as far as possible in units of four protons and two electrons. The 
hypothesis that electrons exist in the nucleus encounters a number of 
difficulties, however, several of which were already apparent long before 
the neutron was discovered. For one thing, there was the problem of 
the magnetic moment: a single odd electron in a nucleus would be expected 
to contribute, from its spin alone, a moment of 1 Bohr magneton; nuclear 
moments are very much smaller than this. More perplexing still was 
a problem arising in the light nuclides with even A and odd Z. Nais 
for example, would have 14 protons and 7 electrons—all F-D particles 
with spin ż- With an odd number of such particles, N™ should evidently 
obey F-D statistics and have a half-integral spin, but study of the band 
spectrum of nitrogen gas showed in fact that N“ obeys B-E statistics 
and has a spin of 1. Finally, according to the uncertainty principle, 
a particle with so small a mass as the electron would have an implausibly 
large kinetic energy if it were confined to nuclear dimensions. 

With neutrons and protons as the building blocks of nuclei, on the 
other hand, these problems find a natural solution. On this picture a 
nucleus with charge number Z and mass number A contains Z protons 
and A — Z neutrons, or A particles in all. Thus N!4 has seven protons 
and seven neutrons; neutrons are Fermi particles and have half-integral 
spin, so that the spin and statistics of N' are accounted for. The magnetic 
moments present less difficulty than before, since only heavy particles 
are involved. The problem of 8 decay is taken care of—or at least put 
in another category—by assuming that a neutron in the nucleus changes 
to a proton, creating the 8 particle (and an antineutrino) in the process. 
Two other types of decay, which were discovered later, also fit into this 
picture: a nuclear proton may be converted into a neutron, creating a 
positive electron and a neutrino (positron decay) or capturing an atomic 
electron (electron capture) and emitting a neutrino. The quantum- 
mechanical treatment of 6 decay bears a close resemblance to that 
desciibing the creation of light quanta in atomic transitions, and the 
formal similarity of the two processes emphasizes the point that the 
appearance of an electron coming out of a nucleus does not imply that 
it ever was in the nucleus. The assumption that neutrons and protons 
are constituents of nuclei and electrons are not is consistent with all 
known facts and may be regarded as a basic tenet of nuclear theory. The 
term nucleon is often used in referring to either protons or neutrons. 
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25.5 Masses and Binding Energies 


Under the assumption that nuclei are composed of neutrons and protons, 
we can mow discuss the various factors which enter into the masses of 
nuclei and gain some insight into their binding. The mass of any 
permanently stable nucleus is found to be less than the sum of the masses 
of the neutrons and protons which it contains. This fact is accounted, 
for by the conversion of part of the mass energy of the particles into 
energy of binding, the relation between the change in mass and the 
binding energy being given by Einstein’s equation, Eyinaing = AMc?. 
The significance of the binding energy can perhaps be seen most easily 
in the more familiar atomic case. In the capture of a free electron by a 
proton to form a hydrogen atom, energy, 13.6 eV, is released (in the 
form of one or more light quanta); the mass of the hydrogen atom is 
therefore slightly less than the sum of the masses of a free electron and 
free proton. To ionize the hydrogen atom, an amount of energy at 
least equal to the binding energy of 13.6 eV must be added. In an atom 
containing more than one electron, the binding energy is the total energy 
required to remove all the electrons; it is thus to be distinguished from 
the ionization potential, which is the minimum energy required to 
remove one electron. In atoms, the ionization potential of the most 
loosely bound electron is a few electron volts; the total binding energy 
for all but the lightest atoms is given approximately by the empirical 
expression Eg = 15.6Z} eV. The change in mass corresponding to the 
binding energy in atoms is small—in the uranium atom it amounts to 
about 1 electron mass, or about 3 X 10-® of the total—and is just at 
the limit of direct measurability; in the nucleus, on the other hand, the 
binding energy may represent an appreciable fraction of the total mass. 


The mass of the nucleus with charge number Z and mass number A 
may be written 


My(A,Z) = ZM, + (A — Z)M, — © (25.6) 


c? 


where M, is the mass of the proton and M, is the mass of the neutron. 
The term Ez/c* represents the mass equivalent of the total binding 
energy, the energy which must be added to the nucleus in order to break 
it up into Z protons and A — Z neutrons. Because it is more nearly 
the atomic than the nuclear mass which is measured in the mass spectro- 
graph, the values given in mass tables usually refer to the mass of the 
neutral atom, including the electrons. Expressed in terms of atomic 
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rather than nuclear masses, Eq. (25.6) becomes 


M(A,Z) = My(A,Z) + ZM. = ZMu + (A — Z)M, — £2 


c? 


(25.7) 


where now My is the mass of the hydrogen atom. Actually a very small 
correction for the binding energy of the atomic electrons should be made 
if the quantity Es is to refer to the nuclear binding energy only, but the 
correction is negligible for our purposes, except perhaps in the heaviest 
nuclei. é 

With the help of Eq. (25.7) and the known masses of the neutron 
and hydrogen atom, we can now compute the binding energy of any 
nuclide for which the mass has been measured. As a simple example 
we take the deuterium atom (hydrogen isotope of mass number 2): 


M(H?) = Mu + Mn -- = 


From Table 10.2 


2.014102 = 1.007825 + 1.008665 — Č? 


Ce 


z = 0.002388 u Ex = 2.225 MeV 


We conclude that it requires 2.225 MeV of energy to dissociate a deuteron 
and that the capture of a free neutron by a proton releases 2.225 MeV of 


energy (in the form of a y ray). In the same way, we find for the binding 
energy of the U??8 nucleus 


4 = 92 X 1.007825 + 146 X 1.008665 — 238.050770 
= 1.93422 u 
Es = 1802 MeV 


Thus the binding energy in this nucleus amounts to nearly 1 percent of 
its mass. 

One interesting result of the computation of nuclear binding energies 
is the observation that, except for the lightest nuclides, the binding energy 
varies practically linearly with A, or to put it another way, the binding 


Binding energy per particle, MeV 
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Mass number A 


Fig. 25.3 Average binding energy per nucleon for the naturally occurring nuclides 
and the unstable nuclide Be*, which decays into two æ particles. 


energy per particle is very nearly constant over a large range of A. In 

Fig. 25.3 is shown a plot of the binding energy per particle as a function 
of A for the stable nuclides. Aside from certain irregularities among 
the light elements, the general course of the curve shows a gradual 
increase to a value of about 8.8 MeV/particle near the middle of the 
table with a falling off to about 7.6 MeV/particle in the region of the 
heavy elements. It is worth observing that the near constancy of the 
binding energy per particle is in striking contrast to the behavior of 
atoms, where the average binding energy per electron is a steadily increas- 
ing function of the number of electrons (proportional to Z$): 


a. Stability against Heavy-parlicle Emission That the total binding 
energy of a nucleus is positive is not a sufficient condition for its stability; 
in order that a nucleus be stable, it is necessary that its mass be less than 
the sum of the masses of any other combinations of its constituent pro- 
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tons and neutrons, whether free or bound in small groups. The æ- 
radioactive nuclei are unstable because the a particle itself is a tightly 
bound structure, and at the upper end of the periodic table, the binding 
energy per particle increases as the number of particles decreases. The 
energy required to remove any given particle from the nucleus is called 
the separation energy,’ E.. The separation energy for an «a particle in 
the nucleus (Z,A) is 


Ea) 


c? 


= M(Het) + M(Z — 2, A — 4) — M(Z,A) (25.8) 


If the a-particle separation energy is negative, the nucleus (Z,A) is 
unstable to « emission. For the light elements, the separation energy 
of an «æ particle is positive, e.g., it requires 4.730 MeV to remove an 
a particle from Ne*. On the other hand, the «œ separation energy of 
Po?! is —5.4 MeV. There exist many isotopes with A > 150 which 
are energetically unstable with respect to a-particle emission. Because 
of the extremely long lifetimes expected for low-energy «œ emitters, these 
isotopes are nonetheless stable for all practical purposes. A still larger 
class are, in principle, unstable with respect to fission into two or even 
three roughly equal parts, but again the lifetimes are so long that the 
effect is generally unobservable. Stability against neutron or proton 
emission is no problem; the separation energy for these particles is 
generally around 7 to 8 MeV, or about the same as the average binding 
energy per particle, for all but the lightest nuclides. 


b. Slability against Beta Decay Still a further condition for the sta- 
bility of nuclei comes as a result of the phenomenon of 8 decay. It may 
occur, and frequently does, that although a nucleus has insufficient 
energy to emit a nuclear particle, a system of lower energy can be formed 
by transforming a neutron into a proton, or vice versa. Most often it 
is the requirement of stability against £ decay which is the most stringent 
condition for the existence of a given nuclide in nature. The nuclide 
(Z,A) is stable against negative £ decay if 


M(Z,A) < M(Z +1, A) (25.9) 


where the M’s are atomic (not nuclear) masses. To see this, we note 
that M(Z,A) is the mass of the nucleus plus Z electrons; if the nucleus 
is to decay, it must supply sufficient m^ss for the new nucleus plus the 


1 Sometimes also called the binding energy dË the particle in the nucleus (Z,A). 
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B particle. The mass condition then is 


M(Z,A) = My(Z,A) + ZM() < My(Z +1, A) 
+ ZM(e) + M(6) 
Í 

where My refers to the mass of the nucleus alone, M(e) to the mass of 
an atomic electron, and M(8) to the $ particle. Since the negative 
8 particle is an electron, the right-hand side adds up to M(Z + 1, A), the 
atomic mass. 

In the decay by positron emission, the initial nucleus must furnish 
mass for the new nucleus and a 8+ particle; the condition for positron 
stability of the nuclide (A, Z + 1) is then 


Mx(Z + 1, A) < My(Z,A) + M(6+) 


Adding Z + 1 electron masses to both sides and noting that the positron 


and electron masses are the same, we obtain the expression in terms of 
atomic masses: 


M(Z +1, A) < M(Z,A) + 2M@) (25.10) 


A third type of 6 process which is of particular importance for 
stability considerations is decay by electron capture, in which an orbital 
electron—usually from the K shell—is absorbed into the nucleus and a 
neutrino is emitted. Since the latter may have zero energy, the condi- 
tion for stability becomes 


Mx(Z +1, A) + Mee) < My(Z,A) 
or, in terms of atomic masses, 


M(Z +1, A) < M(@Z,A) (25.11) 
We have in all three of these expres: 
in the binding energy of the atomi 
all cases quite negligible. 

The two conditions (25.9) and (25.11), for stability against negative 
8 decay and electron capture, guarantee that of any two neighboring 
isobars one must be unstable, for the heavier wi 


y > ll always! be able to 
transform into the lighter by one process or the other. Thus if the masses 


sions for £ stability ignored the change 
c electrons; this quantity is in almost 


1 Still ignoring the electron-binding-energy correction; no case is known where 
this prevents a decay which would otherwise be possible. There exist some cases 
where K-electron capture is not possible and capture from L or higher shells occurs 
instead. 
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of isobars (nuclides with fixed A) increase uniformly as Z is varied away 
from some optimum vaiue, only one will be stable. Tt is just this fact 
that gives the study of the naturally occurring nuclides its great impor- 
tance in connection with the understanding of nuclear structure, for of 
any combination of A neutrons and protons, Just the one which we find 
in nature will represent the configuration of minimum energy. There 
do occur cases among species with even A where the mass is not a uniform 
function of Z; we have then the situation where a nuclide may be unstable 
with respect to both its neighbors, and, for these values of A, two iso- 
bars and sometimes three, differing in charge by 2 units, may occur. 
The isobars Ca‘°-A“° and Zn*-Ni® are examples. 


25.6 Nuclear Forces 


We wish now to see what can be inferred from the material at hand 
about the character of the forces which hold nuclei together. It is in 
the first place clear that they are not ordinary electrostatic forces, if 
for no other reason than that all the particles involved either have 
positive charge or are neutral, and such a system could not be bound 
by classical forces. Even aside from the sign of the force, electro- 
static forces are much too weak to account for the main effects; 
itational forces are weaker still and offer no assistance in the problem. 
It is therefore necessary to postulate an entirely new type of interaction, 
a type which never manifests itself in large-scale phenomena and which 
has no evident relation to either gravitational or electric forces. Since 
we have no previous experience with such forces, the best we can do is to 
make the simplest possible postulates about them consistent with the 
available data. With a finite number of facts available, we cannot be 
sure of a unique solution: it can only be hoped that the solution will 
contain suflicient elements of truth to permit further progress as experi- 
mental knowledge expands. 

One characteristic feature of nuclear forces is their short Tange. 
Although the exact dependence on distance is not well established, there 
is good evidence that the force vanishes for all practical purposes at 
distances greater than a few times 10- m. There are many experi- 
ments which suggest that the force between two nucleons? sets in quite 
abruptly at a separation of lor2 X 10-*m. In the interest of simplicity, 
the force is usually regarded as derivable as the gradient of a potential 


grav- 


1 There do occur a few apparent exceptions to the rule prohibiting neighboring 
stable isobars: for example, Cr*°, V*, and Ti* all exist in nature. Direct mass meas- 
urements show that V5 should be unstable. Its long life is attributed to the very 
high angular-momentum change of 6 units, which inhibits the decay. 

3 Aside from the electrostatic force, which acts only between protons. 
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(in the hope, among other things, that the force between nucleons is 
independent of their velocity), and a shape of the potential is chosen 
which is mathematically not intractable. it will be sufficient for our 
purposes to use as a convenient approximation the so-called square well. 
Conceptual difficulties with the infinite force at the boundary can be 
resolved by letting the transition take place over a small range in radius, 


a. Theory of the Deuteron In order to exhibit some of the concepts 
involved in discussing nuclear potentials and the quantum states of 
nuclei, we consider the deuteron. Although the deuteron is the only 
nucleus which admits of a reasonably exact solution, many features of 
the deuteron problem occur in more complicated nuclei and an under- 
standing of this relatively simple case is useful in these applications, 
Our discussion will be based on the square-well potential illustrated in 
Fig. 25.4, where the heavy line indicates the potential energy of the 
neutron in the nuclear field of the proton (or vice versa). The potential 
energy has the value P = — P, out to the radius R and is zero for larger 
values. We write the wave function for the neutron, in coordinates 
relative to the proton, as v(r,8,¢), and look for a spherically symmetric! 
solution to the wave equation, of the form y = u(r)/r; the absolute square 
of u(r) represents the probability of finding the neutron at distance r. 


1 With the assumed potential, the spherically symmetric solution will have the 
lowest energy. 
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Fig. 25.4 Simplified square-well potential representing 
the interaction of a neutron and a proton. 
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The Schrédinger equation may then be written 


Pur)» Mey Pyar) =0 r<R (25.124) 
dr? h? 
ma + = Eur) =0 r>R (25.128) 


Because the neutron and proton have practically equal mass M, the 
reduced mass M/2 is used in these equations. We must now ascertain 
whether there exist any solutions to Eqs. (25.12) which correspond to 
stationary, bound states; evidently we require solutions for which E, 
the total energy, is negative (see Fig. 25.4). Since the deuteron is 
known to be a bound system, with Æ = —2.225 MeV, the question 
really amounts to choosing values of Po and R which will reproduce 
this fact. For r < R, the solution of (25.12a) is of the form ' 


u(r) = Asinxsr x= grees r<R (25.13a) 


The cosine solution is inadmissible because it is finite at r = 0 and 
y = u(r)/r would be infinite. The outside solution is 


u(r) = Be k= Her r>R (25.138) 


where the positive-exponential solution is rejected because it produces 
an infinite y as r approaches infinity. The condition that these two 
solutions join smoothly at r = R then determines whether a real eigen- 
value for the energy exists. Equating the values and first derivatives 
of uatr = R, we obtain 


Asin xR = Be~? (25.14a) 
Ax cos xR = —Bke-*® (25.146) 


Dividing (25.14b) by (25.14a) 


k ctn «R = —k or ctn xR = -ŁŻ 


(25.15) 


This transcendental equation can be solved numerically or graphically 
to produce the required condition on Py and R for any (negative) value 
of E. For the present purpose, it is sufficient to observe that if we 
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choose Po > E, then k/x is small and the cotangent of xR is small and 
negative; «R is then slightly greater than 7/2. ; The situation is illustrate? 
in Fig. 25.5, where u(r) is indicated as a function of r. The inside (sine) 
function has reached a maximum and just started to decrease where 
it joins the outside exponential. This condition Tepresents the first 
possible eigenvalue, since if the interior function had not yet passed 
its first maximum, it could not be made to ft. a decreasing exponential 
outside. The condition xR ~ 7/2 gives 


F~ R = 2 MP, 
or he 
PoR? = Io = 1-02 X 10-** MeV-m? (25.16) 


using the definition of x in (25.13a) and ignoring the small quantity 
E. If we choose for R the reasonable value of 2 X 10-5 m, we 
obtain Py ~ 25 MeV for the potential energy of the neutron-proton 
bond in the ground State of the deuteron. Taking xR = nr/2, with 
n=3,5,..., would yield excited states of the system, corresponding 


to the excited levels of an atom. It can be shown, however, that no 
bound excited states exist in the deuteron. 


b. Scatlering of Free Nucleons From the-fact that the deuteron is 
a bound system, attractive forces exist between neutrons and protons. 
Further information on the forces can be obtained from a study of the 
scattering of free neutrons by protons. In such experiments a parallel 
beam of neutrons is allowed to impinge upon a target containing hydro- 
gen atoms—paraflin, for example—and the number of neutrons deflected 
through various angles in elastic encounters is determined as a function 
of neutron energy (Sec. 25.15¢). Since they have no charge, neutrons 
are unaffected by the electrostatic field and their scattering will directly 
reflect the operation of the nuclear forces, 


The wave-mechanical theory of neutron-proton scattering is con- 


u(r) 


r=R r 


Fig. 25.5 Radial variation of the function 
u(r) = ry(r) describing the deuteron in its 
ground state. 


Teb 


a 
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structed along much the same lines as is the theory of the deuteron, with 
the difference thht the total energy E is positive for a free neutron, and 
hence all energy values are allowed. The wave function outside the 
radius R is-no longer an exponential function but is a superposition of a 
plane wayé, representing the stream of incoming particles, and expanding 
spherical waves, representing the scattered particles. The calculated 
scattéring turns out to be relatively independent of the assumed shape 
of the potential in the case of low-energy neutrons, and the result is 
essentially fixed by the binding energy of the deuteron. 


that some factor enters the scattering problem which is not present in the 
deuteron. The missing factor was pointed out by E. P. Wigner in 1936, 
who observed that while they are in the ground state of the deuteron, the 
neutron and proton necessarily have parallel spin orientations (to produce 
the observed spin of 1 for the deutron); no such restriction applies in the 
scattering of free neutrons, where both Parallel and antiparallel spins 
occur. It is then possible to adjust the assumed interaction between 


terms of an excited (singlet) state of the deutron, with J = 0 and a nega- 
tive binding energy of about 0.1 MeV. 


c. Charge I, ndependence of Forces It was for some time generally 
believed that nuclear forces operated only between neutrons and protons, 
and a number of theories of nuclear structure were bas 


tion. In 1935, however, experiments on the scattering of high-velocity 
protons in hydrogen gas showed that nuclear forces also play a role in 
the interaction of protons with protons. Further work on the same line 
has led to the conclusion that the nuclear part of the force between two 
protons (p-p) is the same as that between a Proton and a neutron (p-n) 
to a high degree of accuracy.! In addition, a comparison of energies of 
certain isobars in which neutrons are replaced by protons and protons by 
neutrons yields evidence for the equality of n-n, p-p, and P-n forces, 
The now generally accepted view that these three forces are essentially 
the same is often referred to as the assumption of charge independence 


spins. 
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of nuclear forces. This close similarity between neutrons and protons 
in their behavior in the nucleus suggests that these two particles are 
fundamentally closely related, that they may be regarded as two states— 
differing as to charge—of the same particle. 


d. Interaction of Nucleons in Nuclei: Exchange Forces Thus far, we 
have discussed mainly the character of the forces acting between pairs 
of nucleons, as observed in simple two-body systems. We have found 
that an adequate account of these interactions can be given under the 
assumption that the forces are derivable from a potertial which depends 

: somewhat upon the relative spin orientations of the particles but appar- 
ently not upon whether they are protons or neutrons. When we come to 
consider systems comprising more than two particles, we must expect 
complications. Even if the forces between pairs of nucleons were known 
in detail, the mathematical complexity of the many-body problem would 
be formidable. Again, in order to obtain some insight into the binding 
of nuclei, we assume the simplest possible forces and test thr consequences 
of these assumptions by experiment. 

The most natural assumption about internucleon forces is that each 
nucleon is attracted by every other one in the nucleus with a force like 
that acting between two isolated nucleons. That this is not the case 
can be shown from the fact that the observed binding energy per particle 
is essentially constant in all but the lightest nuclei. In a nucleus con- 
taining A particles, A(A — 1)/2 pairs can be formed; if attractive forces 

- existed between all such pairs, the total binding energy would be roughly 
proportional to A? rather than to A. The observed radii of nuclei 
increase with A as if each nucleon occupied a constant volume, irrespec- 
tive of how many others are present in the nucleus. It is evident then 


that the forces between nucleons “saturate” in some way; that once a ` 


nucleon has formed a few bonds, it. ignores any further nucleons which 
may be added to the system. 

A common hypothesis about internucleon forces in the nucleus is 
that they have at least in part an “exchange” character, i.e., that the law 
of force between two particles depends in a direct way upon the symmetry 
of the wave function with respect to their interchange. The binding 
of homonuclear molecules has something of this character, in that the 
effectiveness of the electrostatic force is determined by the spatial 
distribution of the charges, which in turn is governed by the symmetry 
requirements. In molecular binding, however—and, in fact, in all atomic 
phenomena—the forces are electrostatic or’ electromagnetic in origin 
and do not themselves contain any explicit dependence on symmetry 
properties. The energy difference between states of different symmetry 
character is then given by the exchange integral, which takes account 
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of the tendency of particles to cluster together in some states and to 
avoid one another in other states. Such effects presumably operate 
within the nucleus, and a force depending only on distance, called an 
“ordinary” force, will have some saturation effects. Quantitative calcu- 
lations show that the effects so produced are insufficient, and it is neces- 
sary to introduce a force which depends ezplicitly on the symmetry 
of the wave function; such a force is called an exchange force. Several 
types of exchange forces have been proposed; perhaps the simplest of 
these is the so-called Majorana interaction, in which it is assumed that 
two particles attract one another if the wave function describing the 
entire system does not change sign, i.e., is symmetric, when the spatial 
coordinates of the two particles are interchanged, and that they repel 
if the wave function changes sign, i.e., is antisymmetric. For this case 
we assume that the wave function describing a nucleus can be written 
as a combination of one-particle wave functions, each characterized by 
a set of quantum numbers n, l, m, analogous to those used for atomic 
wave functions. Because of the Pauli principle, there can be at most 
four nucleons in any state with a given value of n, l, and m:: two protons 
with opposite spins and two neutrons with opposite spins. The wave 
function is automatically symmetric with respect to the exchange of 
the spatial coordinates of any pairs among these four particles, and ` 
hence there are attractive forces between all pairs. Thus an a particle, 
comprising four nucleons in the lowest state, will be a tightly bound struc- 
ture. If now a fifth nucleon is added, it must necessarily go into another 
state, and the wave function will be antisymmetric with respect to 
exchange of this particle with an identical particle in the first level. 
Exchanges with any of the other three lead to wave functions which are 
partly symmetric and partly antisymmetric. Since the antisymmetric 
pairs are assumed to lead to repulsive forces, the net effect is to make the 
five-nucleon system less strongly bound than the « particle (Li and He 
are in fact not bound, and do not occur in nature). As further nucleons 
are added, the number of both symmetric and antisymmetric pairs 
increases, and by a suitable adjustment of the assumed magnitudes of 
the resulting attractive and repulsive forces, the observed linear depend- 
ence on A of the binding energy and volume per nucleon can be accounted 
for. Detailed studies of the scattering of high energy—up to 350 MeV— 
neutrons and protons in hydrogen give strong evidence for the existence 
of exchange forces; however, the amount of repulsion observed in the 
antisymmetric interactions is too small to account for the saturation and. 
other factors must be considered. 


e. Nuclear Potential 


Although the details of the interacti 
nucleons within an actu: actions between 


al nucleus are complicated, it is possible and useful 
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Fig. 25.7 Qualitative appearance of the wave function 
of an a particle escaping through the nuclear barrier of ` 
Fig. 25.6b. 


function of small amplitude. The ratio of the flux outside to that 
inside is a measure of the probability of escape of the « particle and hence 
of the disintegration constant. An approximate calculation, first made 
by Gamow, and independently by Gurney and Condon, in 1929, yields 
for the probability of escape i 


© = exp [- : Z Je VPO-E a| (25.19) 


R 


This expression for the probability is a sensitive function of the radius R, 
and, by comparing lifetimes predicted by the formula with those observed 
for known « emitters, one can estimate R. The values so obtained are 
generally consistent with Eq. (25.5), with Ro = 1.2 to 1.4 X 10-!5 m. 
The probability of escape is also a most sensitive function of the energy 
E; for example, the energies of the a particles from Po” and Th?» 
differ by less than a factor of 2, while the half-lives differ by a factor of 
about 107%) It is evident that the electrostatic forces play a decided role 
in many nuclear phenomena and that to separate out the effects of the 
nuclear forces may be a matter of some difficulty. 


g. Neutron-Proton Ratio in Stable Nuclides Saturation of nuclear 
forcee *s at least partially accounted for by their exchange character. 
Cert, 1 other features of the distribution of stable nuclides also find a 
natural explanation under this hypothesis. Thus, the observed pref- 
erence for near equality of IV and Z follows immediately from the strong 
dependence of the nuclear force on the’ spatial symmetry of the wave 
function. Referring to our previous argument, we remember that.four 
different nucleons (differing as to charge or spin orientation) can be placed 
in one n,l,m; level (Fig. 25.8a), where they attract one another strongly. 
If we wish to construct a nucleus from four protons, on the other hand, 
two would have to be placed in another level (Fig. 25.8b) and the total 
number of attractive bonds would be smaller and the system less stable. 


eee eee eee eee eS 


(a) (8) 


Fig. 25.8 Occupancy of nuclear quantum 
states by (a) two neutrons and two protons and 
(6) four protons. 


The same argument can be extended to larger numbers of nucleons and 
shows that nuclei with N = Z or N = Z + 2 are the most stable as 
far as the purely nuclear forces are concerned. As the number of par- 
ticles increases, the mutual electrostatic repulsion of the protons will 
begin to make itself felt, effectively reducing the potential to which the 
protons are subject (Fig. 25.6a). In consequence of this effect, the 
proton levels will be displaced from the neutron levels, as indicated 
schematically in Fig. 25.9, and it will become advantageous to add a 
few extra neutrons. Thus.2sNi*, with 28 protons and 32 neutrons, is 
more stable than soZn®. Since the electrostatic energy increases roughly 
as Z*, this effect becomes more pronounced as Z increases, and for high 
Z there are not only many more neutrons than protons, but the average 
binding energy per particle is effectively decreased. 

A word of warning is perhaps in order at this point concerning such 
diagrams as Fig. 25.9. Since the potential is contributed by the nucleons 


Fig. 25.9 Neutron and proto 
levels in „Nito, Only "the 
uppermost levels are shown, 
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themselves, the shape of the potential function will change as nucleons 
are added or removed, and the location of levels representing other than 
the most loosely bound nucleons has little significance. The diagram 
indicates qualitatively the effect of adding or moving one or two nucleons 
or of moving one nucleon from one state to another, but even so the 
magnitude of the total energy change may be quite different from the 
value the diagram suggests. 
We observe a strong preference among the stable nuclides for even 
N (N = A — Z, the number of neutrons) and even Z, as opposed to 
odd N and odd Z, while nuclides with odd Z and even N and those with 
even Z and odd N occur in roughly equal numbers. These facts we 
can now understand in terms of the displacement of proton levels with 
respect to neutron levels. In 2,Ni®° (Fig. 25.9), for example, there are 
14 filled proton levels, each with two protons (with spins opposed), and 16 
occupied neutron levels. If another particle is to be added, it must go 
into an unoccupied level, and the chances are about even that the next 
available level will be a neutron level. Thus among the stable isotopes 
with an odd total number of particles A, it is to be expected that about 
half will have odd N and about half odd Z. In Ni®, it happens that the 
next available level is a neutron level, corresponding to Ni®; if now 
another particle is to be added, adding a second neutron will be more 
! favorable than adding a proton, because the neutron level is lower and 
there is room for two particles in it. Thus 2sNi®? is more stable than 
29Cu®?; in fact Cu® is radioactive and decays to Ni®?. Had the proton 
level been the more stable, i.e., if Cus! were more stable than Ni*®, the 
preferred second particle would have been a proton, forming Zn? In 
either case, the result is a preference for the even-N, even-Z combination 
ver the odd NV, odd Z and the latter, in medium and heavy nuclei, is 
always unstable with respect to the former. Among the light elements 
th ere are a few cases of stable odd-N, odd-Z nuclei; in these instances, 
the\ electrostatic effect: is relatively unimportant, and the small spin 
dependence of the force gives rise to a preference for equal N and Z. 
We have thus accounted for the main properties, the mass, charge, 
and size of the nuclides which are found in nature, making only rather 
general assumptions about the nature of nuclear forces. We have 
assumed that the forces act only over a short range, that they act approxi- 
mately equally strongly on neutrons and protons, and that they have a 
strong dependence on the symmetry of the wave function. On the other 
hand, we have seen that there exist several alternatives to the last assump- 
tion, and it is by no means clear that this assumption is either adequate 
or essential. 
About the structure of the nucleus, very little has been assumed in 
the foregoing. Whether the “quantum states” which we have used 
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really represent the independent motions of individual nucleons, or 
whether the motions are so strongly perturbed that the “states” must 
be taken to refer to some sort of collective motion of the nucleons as a 
whole is not essential to these arguments. We discuss in later sections 
experiments which have a bearing on a specific model for the nucleus. 


Positrons, Artificial Radioactivity, Artificially 
Accelerated Particles 


25.7 Induced Radioactivity 


In 1930 Becker and Bothe bombarded Be with « particles and reported a 
spectrum of electrons extending up to 4.5 MeV, which they attributed 
to a y ray of about this energy. To study the matter further, the Joliot- 
Curies placed a lead foil and a polonium-beryllium source in a cloud 
chamber, and with a magnetic field of about 1000 gauss, proceeded to 
study the electrons ejected from the lead by the beryllium radiation, In 
1932 they observed a number of “electrons” which seemed to be moving 
loward the lead foil, since their tracks were curved in the wrong sense. 
(The sense of deflection depends, of course, both on the sign of the charge 
and the direction of motion.) It is now clear that the tracks were due 
to positrons. After the announcement of the discovery of the positron, 
the Joliot-Curies conducted further experiments on the radiation pro- 
duced in Be by a-particle bombardment. They confirmed that positive 
particles are produced when the hard y radiation is converted in 
lead. In a similar experiment in which aluminum, instead of beryl- 
lium, was bombarded by a particles, they found positrons again, 


Further study showed that 
the positrons were not produced in the converter but came from the 


aluminum source itself. Finally, in the course of an investigation of 
the effect of varying the a-particle energy, they noticed that the Positrons ` 
did not appear immediately when the polonium source was put in place but 
built up gradually over a period of minutes. When the a particles were 
removed, the effect died down gradually, again within a -period of some 
minutes. As a result of these experiments, the Joliot-Curies concluded 
in 1934 that the effect must be ascribed to a radioactive material produced 
by the a-particle bombardment. The reaction they suggested was 


1A]? + Het — P? + on! 


The isotope P” does not exist in nature, and they assumed that it decays 
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spontaneously, with emission of positrons, to a known stable isotope of 
silicon by the reaction 


nPE Si? + Bt + v 


The half-life was 3.25 min. That the radioactive substance was an 
isotope of phosphorus they subsequently confirmed by showing that the 
active material had the same behavior under chemical analysis as ele- 
mental phosphorus. : 

In the same experiments, the Joliot-Curies found that boron bom- 
barded by a particles also produced a positron activity with a half-life 
„of about 14 min., This activity they attributed to another new isotope, 
N?*, formed in the reaction 


5B! + „Het — ;N!3 + gn} 
and decaying to C!#: 
1N —> C3 + p+ + 1 


They suggested that the same radioisotope might be produced by artifi- 
cially accelerated deuterons bombarding carbon, through the reaction 


(Cl? + Ht NY + ont 


This reaction was reported by Crane and C. C. Lauritsen in 1934, 

. Since the time of these early experiments, a great many radioactive 
isotopes have been produced artificially, with protons, deuterons, œ par- 
ticles, neutrons, and many other particles as bombarding agents; more 
than-1200 species are known. In addition to the fact that thestudy.of 
these radioisotopes has yielded an enormous amount of information about 
the nucleus; they have had a considerable practical importance, particu- 

_ larly as tracers in chemical and biological investigations. In work of this 
kind, certain compounds are “‘tagged” by incorporating into them a 
radioactive isotope of one of their normal constituents, and the subse- 
quent progress of this constituent through various chemical changes is 
traced by means of its easily detectable radioactivity. This technique 
ait. many operations not amenable to chemical analysis, such as study 
ay sighs are of-an element from ‘one ionic form: to another or 
niam. Rudin element is exhausted and replaced. in a living 

pes of almost all the elements of biological interest 


are available: some of the most used are C14 (wi Lakh 
with a half-life of 5400 years), 
pz (30 days), Na?! (14 hours), and S* (87 days). 


x 
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25.8 Nuclear Transformations with Artificially 
Accelerated Particles 


From the time of the earliest observations of transmutations induced by 
particles from radioactive Sources, there was speculation as to whether 
it might not be possible to use artificially accelerated positive ions for the 
purpose. One obvious advantage would lie in the much gréater number 
of particles available, for even a current as small as 1 uA represents 
some 6 X 1012 (singly charged) particles per second, or about the num- 
The prospect of inducing nuclear reactions with particles of practically 
attainable energies seemed rather hopeless, however, in view of the 
coulomb barrier. The barriep height of a nucleus of charge Z and mass 
number A for a singly charged particle is 


P= WeZehi or Pos Ra MeV 
4reoR At 


so that something of the order of 1 or 2 MV would seem to be required 
even for the lightest elements, 


discharge tube into which ions were injected from a Wien-type canal-ray 
tube. With this installation they obtained a beam of 300 keV protons 


125 keV energy bombarding a target of lithium metal, they observed 
bright scintillations—about five per minute with a beam of 1 uA. As the 
Proton energy was increased, many more scintillations were observed; 
at 500 keV, they estimated the efficiency to be about 10-8 disintegrations 
Per incident proton. Absorption measurements gave a value of 8.4 cm 
for the maximum range of the Particles Producing the scintillations, and 
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measurements of the ionization showed that they were « particles, It 
was concluded by Cockroft and Walton that the reaction involved was 


Li + iH — Het + Het 


From the (poorly known) masses, they estimated that 14.3 + 
2.7 MeV should be available for kinetic energy of the « particles; from the 
observed range they calculated 17.2 MeV, in reasonable agreement. 
That two a particles were in fact produced simultaneously, in opposite 
directions, they checked with the arrangement shown in Fig. 25.10. The 
target consisted of a thin layer of lithium deposited on a thin sheet of 
mica, mounted at 45° to the incident proton beam. Two mica windows 
in opposite sides of the target tube allowed a particles produced in the 
target to escape and to be observed on the two scintillation screens Sı and 
Sa. Two observers recorded the scintillations independently on a single 
moving tape: the existence of a large proportion of coincidences on the record 
was evidence that the process assumed was taking place. This conclusion 
was later confirmed by Dee and Walton with cloud-chamber photographs. 

A number of other elements were investigated by Cockroft and 
Walton in 1932 to 1933. Both boron and fluorine were found to give 
comparatively large a-particle yields under proton bombardment; other 
elements, including Be, Na, and K, produced detectable numbers. The 
disintegration of Li, B, and F by 1.2-MeV protons from the cyclotron was 
reported by Lawrence, Livingston, and White in 1932, only a few months 
after Cockroft and Walton’s first report. Oliphant, Kinsey, and Ruther- 
ford observed the disintegration of Li and B by deuteron bombardment 
and Crane, Lauritsen, and Soltan announced the artificial production of 
neutrons by helium-ion bombardment of beryllium in 1933. 


Tig. 25.10 Arrangement for 
detecting coincident particles 
produced in the proton bom- 
Lardnent of Li’. 
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Experiments on the disintegration ‘of lithium with protons pro- 
vided the first direct quantitative check of Einstein’s equation, E = me’, 
relating mass and energy. The fact that all the masses involved were 
known from mass-spectrometer measurements and that the energy release 
was large compared to the errors of observation made it possible to check 
the agreement between the loss in mass and the observed kinetic energy 
within about 20 percent. Modern determinations from other nuclear 
reactions check this relation to high accuracy. 


25.9 Accelerators 


The years that have passed since the experiments of Cockroft and Walton 
have seen a considerable development in techniques and equipment for 
production of high-energy charged particles. We shall enumerate here 
some of the principal types which have found application in problems of 
nuclear physics. 


a. High-tension Set Voltages up to a few million have been produced 
by straightforward cascade connection of transformer-rectifier sets. 
Because of the difficulty of building rectifiers capable of withstanding 
high voltages, such sets are ordinarily built in units of about 200 kV, 
either with individual transformers, each excited by a special winding on 
the previous one, or with a single transformer and some form of voltage- 
multiplying circuit. The principal load on such a set is often corona loss 
(local ionization of the air near sharp points), which may amount to sev- 
eral milliamperes, and, unless large condensers are used, an undesirable 
ripple in voltage results. For example, if the capacitance is of the order 
of 10-*uF, a drain of 1 mA produces a voltage change of 1.6 kV in wn 8. 
For many purposes, a ripple of this magnitude is acceptable, but situations 
do occur where a steadier voltage is desirable. For this reason, such 
installations are often designed to work at frequencies of 500 Hz or more. 
A sketch of a typical accelerating tube is shown in Fig. 25.11. The 
tube is broken up into many sections, preferably with a controlled voltage 
for each section, partly to aid in focusing the ion beam and partly to dis- 
tribute the voltage gradient as uniformly as possible along the insulating 
surfaces. The positive ions to be accelerated are generated in an ton 
source, located on the top of the tube. Gas—hydrogen, if protons are 
desired—is admitted into a discharge tube, in which the atoms are ionized 
by electron bombardment. An electrode at the bottom of the ion source, 
provided with a fine hole and held at a negative potential with respect to 
the discharge tube, extracts the ions and injects them into the main 
accelerating tube. By means of a series of electrostatic lenses, the ions 
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—— Ion Source 
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Fig. 25.11 Schematic diagram 
of a high-voltage accelerating 
tube and a 90° magnetic analyzer. 


< 


are collected into a narrow bundle as they are accelerated down the tube. 
Since a certain amount of gas also escapes into the tube, it is necessary to 
evacuate the tube continuously with large diffusion pumps. 

The ion source produces, besides protons (H+), also singly charged 
ions H;+ and H;+. In addition, incidental ions formed in the residual 
gas in the main tube are collected in the beam and accelerated. It is 
necessary, therefore, to provide a magnetic deflection or combined mag- 
netic and electrostatic deflection at the lower end of the tube to reject 
the undesired components. With a properly designed magnetic or elec- 
trostatic analyzer, the measured field can be used to determine the 


particle energy with much higher accuracy than a direct voltage measure- 
ment can give. 


b. Electrostatic Accelerator A convenient way to produce a steady 
high voltage without the difficulties of rippl 
devised by Van de Graaff in 1931. 
schematic diagram of Fig. 25.12. Two r 


vanae 
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driven by a motor, the upper located in the high-voltage terminal, well 
insulated from ground. Over the rollers passes an endless belt of 
insulating material, and, across the face of the belt, opposite the lower 


++++++ 


Roller 


Fig. 25.12 An electrostatic accelerator in which 
charges are sprayed on a moving belt and carried 
up to an insulated high-voltage terminal. 
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sions when high pressure is used. For example, clearances of the order 
of 10 to 15 ft are required around a 1-MV installation at atmospheric 
pressure, while 5- to 8-MV electrostatic accelerators can be housed in 
8- or 10-ft-diameter pressure tanks. 


c. Cyclotron High particle energies can be obtained without the use 
of high voltages by means of the magnelic-resonance accelerator, or 
cyclotron, devised by Lawrence and Livingston in 1932. In the cyclotron, 
charged particles are given repeated accelerations in a radio-frequency 
field. Under the influence of a strong magnet, the particles move 
alternately through field-free regions and regions of high field, in phase 
with the changing electric field. A diagram illustrating the principle 
is shown in Fig. 25.13. Within a flat, cylindrical vacuum chamber 
B, placed between the poles of a magnet (not shown), are two D-shaped 
electrodes Dı, D, (called dees) consisting of hollow, flat half-cylinders. 
The dees are coupled to, or made part of, a resonating electric circuit 
driven by a high-power radio-frequency oscillator, so that an alternating 
voltage appears across the gap separating the dees. An ion source C, 
located at the center of the chamber, supplies positive ions with a rela- 
tively low initial velocity. 

We consider now a positive ion which finds itself in the gap between 
the dees, moving upward during the part of the cycle when the upper dee 
in the diagram is negative. The ion is accelerated in the electric field 
into the field-free region within the dee. Under the influence of the 
magnetic field, which we take to be directed out of the paper, the particle 
traverses a semicircle, returning to the gap (but now moving downward) 


Oscillator 


Fig. 25.13 Schematic diagram of a cyclotron operating in a 
magnetic field out of the plane of the paper. 


T 
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after a time! ! = +/ws, where we is its angular frequency in the magnetic 
field: 


an = z =By (25.20) 


If the angular frequency of the radio-frequency field wg is so adjusted 
that the potential difference of the dees has reversed during the time that 
the particle was inside the dee, i.e., if ws = we, the ion will again be 
accelerated. The crucial point is that the angular frequency ws is 
constant, independent of the velocity of the particle (insofar as the 
relativistic variation of mass can be neglected) ; as the velocity increases, 
the radius of curvature also increases and the time per half-turn remains 
constant. This process of acceleration every half period continues, with 
the particle moving in a series of connected semicircular arcs, until a 


Parallel beam is produced. 


The maximum energy which can be achieved depends upon the 


available magnetic field, the limiting radius Ro, and the nature of the 
particle. ; 


E p? f: Bte?R,? 
Koa = 2M = 2M (25.21) 


For example, a 60-in.-diameter cyclotron, operating at a magnetic field 
of 16,000 gauss, can produce 26-MeV deuterons with a radio frequency 
of 12 MHz. In principle, protons of twice this energy can be produced 
by doubling the frequency, but changing the frequency usually involves 
a major operation, so that cyclotrons are sometimes operated at reduced 
B for protons and produce only- half the Possible energy. 
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d. “Synchrocyclotron The constancy of the frequency ws in Eq. (25.20) 
breaks down when the particles have relativistic velocity (v ~ c), so that 
M deviates appreciably from the rest mass. This change could, in prin- 
ciple, be allowed for by shaping the magnetic field in such a way as to 
increase B slightly as the radius increases, but such a variation would 
introduce serious defocusing of the beam. The alternative, suggested 
independently by McMillan and Veksler in 1945, is to allow the mag- 
netic field or the radio frequency to vary with time as the particles are 
accelerated ini sucha way as just to compensate for the changing mass. 
It: ¢an:be shown ‘that ‘in either case there exist stable orbits in which the 
particles: tend to keep in step:-with the changing fields. A cyclotron 
making-use of this principle is called a synchrocyclotron. Ordinarily, a 
synchrocyclotron: is !used «to accelerate ‘heavy particles—protons, deu- 
terons; or a ‘particles—and ‘the radio frequency is varied while the mag- 
netic field is held fixed. Inthe Berkeley 184-in. frequency-modulated 
synchrocyclotron; which produces 340-MeV protons, the required fre- 
- quency change:is about 35 percent. For electron acceleration, or for 
acceleration of heavy particles'to energies exceeding a few hundred MeV, 
thé synchrotron is generally preferred. 
e.\Synchrotron:, The synchrotron operates on essentially the same 
principle as the synchrocyclotron except that the particles are injected 
at near-relativistic speeds and the magnetic field is varied in time, with 
the radio fréquency held constant. The particles are injected at a rela- 
tively low value of the magnetic field and pick up energy from the radio- 
frequency. accelerating gap as the magnetic field increases. With con- 
stant radio frequency wọ and essentially constant velocity (near c) the 
particles\remain at nearly the ‘same radius, R = c/w throughout the 
acceleration; and ‘only at this: radius is a magnetic field needed. This 
feature means a great saving, since the entire center portion of the mag- 
net is eliminated. A number of electron synchrotrons yielding energies 
of 1 GeV or more have been built, 
(12 “For heavy particles, it is not so easy to.inject with v ~ ¢. For this 
reason, one ordinarily varies both the radio frequency and the magnetic 
field, in such a way as to keep the orbital radius approximately constant. 


US Belatron ; ý: The betatron or induclion accelerator, invented by Kerst 
in 1940, is a device for accelerating electrons.. Its Operation depends 


simply on the fact that a changing magnetic flux induces an electro- 
motive force in any enclosing ,“reult. From the law of Ampére, we 


} 


have, for any closed path, 
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An electron describing a circular path of radius R in a changing magnetic 
field is subject to a force f= eE. The rate of change of momentum 
p is then 


where ¢ = fB - dS is the flux enclosed by the path. The momentum 
gained if the electron starts from rest when ¢ = 0 is 


2 oe m 


On the other hand, the value of the magnetic field at the orbit required 
to keep the electron at the constant radius R is 


Thus we have an equilibrium situation at all times if 


$ 
Da = RE (25.22) 


i.e., if the value of Br at the orbit is one-half the average of B over 
the area bounded by the orbit. The required variation of B can be 
obtained by choosing a suitable shape for the pole pieces of the magnet. 
The fact that the so-called belatron condition (Eq. (25.22)] can be Satisfied 
independently of relativistic considerations means that the device can 
be used to accelerate electrons to very high energy. The maximum 
energy is determined by the available magnetic field and the radius, 


g. Linear Accelerator The linear accelerator also uses the principle 
of repeated accelerations, but the particles are allowed to travel in straight 
lines instead of being deflected in a magnetic field. The accelerator con- 
sists essentially of a long pipe, with coaxial cylindrical accelerating elec- 
trodes mounted inside, separated by short gaps. The whole chamber 


oscillators in such a 
along the axis, The 
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Nuclear Reactions and Nuclear Models 


With the availability of artificially accelerated particles, the number 
of nuclear reactions amenable to study has increased enormously. Thou- 
sands of reactions have been studied, many in considerable detail. 
In the following sections we shall consider some reactions which occur, 
the techniques used in studying them, and what can be said about 
nuclear models as a result of such studies. 


25.10 General Features of Nuclear Reactions 


a. Types The most commonly used bombarding particles are protons, 
deuterons, and neutrons, the latter themselves produced in a nuclear 
reaction. Artificially accelerated a particles are also frequently used, 
as well as H? ions (tritons), He? ions, and heavier ions. y rays and x-rays 
can also be used as disintegrating agents, and some reactions induced by 
electron bombardment have been observed. In general, at moderate 
bombarding energies—less than 10 MeV—two products appear as a 
result of a nuclear reaction: a light particle, usually one of the above 
mentioned, and a heavier one, referred to-as the residual nucleus. Three- 
body production sometimes occurs, and at high energies even more 
products may be observed. As a convenient classification of the simpler 
types of nuclear reactions, we enumerate the following categories. 


(1) ELASTIC SCATTERING The incident particle strikes the target 
nucleus and undergoes an elastic collision. When this process is viewed 
in a coordinate system in which the target nucleus is initially at rest (the 
laboratory system), the scattered particle loses energy because the target 
nucleus recoils. In the center-of-mass system, in which the two particles 
approach one another in such a way as to keep the center of mass at rest, 
there is no energy transfer. 


(11) INELASTIC SCATTERING The incident particle reappears after 
the interaction with a lower energy; some of its energy has been taken up 
by the target nucleus, which is excited to a higher quantum state. An 


example is 
Li? + H'— Li’* + H! 


The asterisk is used to indicate that the residual nucleus is in-an excited 
state. In-the present example the excess energy is later radiated away 


in the form of a y quantum. : 


The Nucleus 701 


(111) SIMPLE CAPTURE The incident particle is captured by the 
target nucleus, and a new nucleus is formed. Nearly always the residual 
nucleus has a considerable excess of energy and radiates one or more 
y quanta. An example is 


Cc + H! => (N3) => NB + Y 


Here the parentheses are used to indicate that the nucleus in question 
is in an excited state with an energy which is determined by the initial 
conditions of the reaction. 

Particularly in elements of high atomic number, it occurs not infre- 
quently that the energy released in the transition between two nuclear 
quantum states is taken up directly by an atomic electron, without the 
intervention of a y ray. In this process, called internal conversion, the 
oscillating electromagnetic field associated with the nuclear transition 
interacts with the atomic electrons, resulting in the ionization of the 
atom. Just as in the ordinary photoelectric effect, the transition energy 
is given by the sum of the observed energy of the ejected electron plus its 
binding energy in the atom. The electron lines so produced are very 
sharp, and they have been much used in determining the energy levels of 
the heavy elements. By comparing two internal-conversion electron 
Tnes from the same transition, for example, lines due to conversion by 

iK- and L-shell electrons, one can determine the difference in the atomic 
binding energies and by comparing with the known values from x 
data, one can identify (as to Z) the radiating atom. 
has been used to ascertain whether th 
precedes the disintegration. The ratio of the number of transitions of a 
given energy which result in internal conversion to the number which 
yield y rays is called the internal conversion coefficient a. In the natural 
radioactive elements, « normally lies in the range 10— to 10-1, In a few 
cases, however, where the y radiation is forbidden or strongly discour- 
aged, as for example when the angular momenta of the initial and final 
states differ by several units, the internal conversion coefficient may be 
much larger than 1. It is possible by means of the quantum theory to 
calculate the internal conversion coefficients to be expected for various 
values of the angular-momentum change in any given transition, so that 
the experimental determination of « can yield im 

the nuclear quantum states involv 
Po?!4, where an excited state, at 1.41 
ground state entirely through intern 
y ray is observed at all. There is re 


-ray 
This procedure 
e transition in question follows or 


st occurs in 
ition to the 


states being strictly forbidden, Ween spin zero 
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(Iv) DISINTEGRATION On striking the target nucleus, the incident 
particle is absorbed and a different particle ejected. An example is the 
disintegration of beryllium by a particles, producing neutrons: 


Be’ + He*— C! +n! 


(Vv) PHOTOEXCITATION AND PHOTODISINTEGRATION À y ray is 
absorbed by the target nucleus, exciting it to a higher quantum state; if 
the energy is sufficient, a particle may be ejected. The photodisintegra- 
tion of the deuteron, requiring 2.225 MeV, is an example: 


H?+7—H'+n! 


A special case of photoexcitation of some interest is the so-called cou- 
lomb excitation, in which the varying electric field of a charged particle 
passing by a nucleus may induce transitions to higher states. 


(VI) PRODUCTION oF “STRANGE” PARTICLES In high-energy reac- 
tions mesons, hyperons, and antiparticles of the proton and neutron may 
be produced. Use of these particles to bombard targets leads to addi- 
tional reactions. All the mesons and the unfamiliar members of the 


baryon group are brought forth in the laboratory through high-energy 
nuclear reactions. 


(vit) SPONTANEOUS DECAY f- and a-decay processes may be 
regarded as nuclear reactions; they differ from those discussed above in 
that the total energy of the system is not under the experimenter’s 
control. 


b. Notation It is convenient to have a shorthand notation for nuclear 
reactions; a commonly used form is the following: 


XA(a,b) Y” 


where X4 and Y“ indicate the chemical symbol and mass number of the 
initial target and residual nuclei, respectively, and a and b the incoming 


(bombarding) and outgoing particles. Thus, the reaction first studied 
by Rutherford (Sec. 25.1) would be written 


N}4(a,p)017 
A radioactive disintegration is sometimes represented ag 


XA (B+) y4 
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for example, 
P30(gB+) Sis? 


The concomitant emission of the neutrino is usually not indicated. A 
reaction leading to an excited state of the residual nucleus which subse- 
quently decays by y emission may be written in two stages, 


N'4(a,p)O!7* O*(y)O 
or abbreviated, 
N’4(a,py)07 


c. Conservation Laws In any nuclear reaction, certain quantities must 
be conserved: 


1. The total energy of the products, including both mass energy and kinetic 
energy of the particles, plus the energy of any y rays or neutrinos which may 
be involved, must equal the mass energy of the initial ingredients plus the 
kinetic energy brought in by the bombarding particle (the target nucleus 
is ordinarily at rest). 

2. The total linear momentum of the products must be equal to the momentum 
brought in by the bombarding particle. . 

The total electric charge is conserved, 

Thè total number of baryons is constant. 


The total angular momentum J, comprising the vector sum of the intrinsic 
angular momenta J (spins) and relative orbital momentum 1 of the particles, 
ig conserved. If two particles collide head on, they have no relative orbital 
angular momentum and the value of J will be given by vector addition of 
the two intrinsic spins. Ina glancing collision, orbital angular momentum is 
present and must be added (again vectorially) to the two J’s to produce the 
total angular momentum J. In the general case, the beam of incident par- 
ticles is represented as a plane wave which may be resolved into components 
representing various quantized values of the orbital angular momentum. 
The parity (Sec. 25.3) of the system determined by the target nucleus and 
bombarding particle must be preserved throughout the reaction. The initial 
system comprises the target nucleus and the incident particle, approaching 
with a certain fixed orbital angular momentum. If the wave function 
describing this situation remains unchanged under inversion of the coordi- 
nates, the parity is even; if the wave function changes sign, the parity is odd. 
The parity for such a system is the product (+1 for even parity, —1 for odd) 
of the intrinsic parities of the target nucleus and bombarding particle and 
the contribution from the orbital angular momentum l; for even l, the con- 
tribution is +1, for odd J, it is —1. 


nPE 


d. Nuclear-reaction Kinetics The energy balance in a typical nuclear 
reaction may be written 


Met + Mic? + E, = Mac? + My? + E, 4 E; (25.23) 
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where Mo = exact mass of target particle 
My, = mass of bombarding particle 
Mz = mass of ejected particle 
residual nucleus 
E., E», E; = kinetic energies of the corresponding particles 
The energy change or Q of the reaction is defined by the mass difference 


3 
ll 


M. + Mı = M+M: +g (25.24) 


Evidently 


Q = E: + E: — E, (25.25) 


If Q is positive, the reaction is exoergic, i.e., energy is released in the 
process; if it is negative, the reaction is endoergic, and energy is absorbed. 
The determination of Q is of interest both from the standpoint of measur- 
ing masses—as a check on mass-spectrometer values for stable isotopes, 
and as a method of obtaining masses for unstable ones, e.g., the neutron— 
and also, where excited levels of the residual nucleus occur, for determin- 
ing the energies of these levels. 

In an experiment to measure a Q value, the bombarding energy E, 
and the energy of the ejected particles E, at some specified angle 0 are 
measured. The Q value can then be obtained by applying the laws of 


conservation of momentum. The momentum diagram for a typical reac- 
tion is illustrated in Fig. 25.14. We have 


Piz = Poz + Paz 0 = py + Pu E, = E + E: — Q 


Fig. 25.14 Momentum diagram for a nuclear 
reaction. The incident particle has momentum 
pı, the ejected particle momentum p», and the 
residual nucleus momentum pa. 


ara 


= 
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Fig. 25.15 Arrangement for a precise Q-value measurement. The incident 


particles are analyzed by the magnet A, and strike the target T; the ejected 
particles are analyzed by A. 


or, since p = 1/2MBF in the nonrelativistic case, 


V2MiE; = V2M,E, cos 6 + V2M3E; cos 
0 = /2M2E;z sin 6 + V2M3E; sin 


We eliminate ¢ and E; to obtain Q in terms of Fj, E», and the masses 


Q=-F, et E, mM ~ 77; MiMLELE,)\ cos 0 


(25.26) 


Since the masses M rarely differ from the mass numbers A by more than 
a fraction of 1 percent, it is usually sufficiently accurate to use mass 
numbers in Eq. (25.26). Relativistic corrections are rarely necessary in 
work at moderate energies. 

A typical experimental arrangement for measurements of this kind is 
illustrated schematically in Fig. 25.15. A beam of particles, produced in 
some type of accelerator, enters the magnetic analyzer Ai, where it suffers 
a 90° deflection before striking the target T. The path of the beam is 
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defined by the set of slits S,, and the energy of the particles is then deter- 
mined by a measurement of the magnetic field in A;. Of the particles 
produced in the target T, some of those which emerge at 0 = 90° enter 
the magnetic spectrometer A2. A measurement of the magnetic field 
in Az which is required to pass these particles through the slit system Sz 
to the detector D then permits determination of £2(90°), the energy of 
the particles emitted at @ = 90°. In principle, once the geometry is 
established, the Q-value determination requires only the measurement 
of the two magnetic fields. i 

With modern techniques, it is possible to measure Q values, for 
nuclear reactions producing charged particles, to an accuracy of 0.05 per- 
cent or better, and a great many Q values are now known to this precision. 
As an example of the close agreement obtained in different laboratories, 
the Q values of the reaction Li7(pa)He', are shown in Table 25.1. The 
mean deviation of these four values is less than 3 parts in 10,000. 


Table 25.1 Q values for Li?(p,«) Het 


Q, MeV Source 
17.338 + 0.011 California Institute of Technology 
17.340 + 0.014 Massachusetts Institute of Technology 
17.352 + 0.009 University of Birmingham (England) 
17.344 + 0.013 Rice Institute 


e. Mass- Determinations As one result of the growing collection of 
such precise Q values as the one just discussed, it has become possible 
to establish the masses of a large number of nuclides which do not exist 
in nature. The mass of the neutron, for example, can be determined 
relative to that.of the hydrogen atom in the following set of reactions: 


t 


H? + H?— H? + H! Qı = 4.0337 + 0.0017 MeV (25.27a) 
H? + H?— He? +n Q: = 3.267 + 0.007 (25.27b) 
H? — He? + $- Qa = 0.0186 + 0.0002 (25.27¢) 


If the second of these relations is subtracted from the sum of the first 
and third, there results [M (n) — M(H Jc? = Qı + Qs — Qo. We obtain, 
for the neutron-hydrogen atom mass difference, n — H! = 0.785 MeV. 


A better value, based on many different measurements, is 


n — H! = 0.78245 + 0.0001 MeV 


It is also possible, by combining reactions involving various nuclides, 
to derive independently of mass-spectrographic data a set of masses 
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based on the standard C!*. For example, the Q value for the reaction 
N#4(d,a)C? (the symbol d represents the deuteron) is accurately known. 
By combining this value with the Q values of several other reactions, the 
mass difference N'4— C! may be directly determined. In a similar 
way, other reactions can be found, connecting various of the light nuclei, 
and eventually a chain can be established leading directly to C2, Sucha 
chain necessarily contains many links, with errors which accumulate, 
but because it is mass differences which are determined, and these are 
small compared to the actual masses, the final accuracy is still good. 


25.11 Masses of Mirror Nuclides 


Among the many unstable nuclides whose masses are known from nuclear- 
reaction Q values, a set of particular importance is the so-called mirror 
nuclides. Two nuclei are said to be mirrors if one can be derived from 
the other by replacing all protons with neutrons and vice versa. Thus 
the mirror of ;B"', with five protons and six neutrons, is ¿C". Evidently 
it is not possible for both to exist in nature permanently, because one 
can be converted into the other by $ decay; in the present instance, C1 
is unstable to positron emission and has a half-life of 20.4 min. The 
mass difference C!! — B™ can be found either from the observed end 
point of the positron spectrum, 


M(C¥) — M(B") = Emax(8+) + 1.022 MeV 


(Sec. 25.56), or from the reaction B"(p,n)C4. The Q value of the (p,n) 
reaction has been determined with great precision, and is 


Q = —2.763 + 0.001 MeV 
Taking into account the 0.782-MeV n — H? difference, we obtain 
M(C") — M(B") = 1.981 MeV 


The observed mass differences of mirror nuclides provide a sensitive 
comparison of the energies associated with p-p and n-n binding forces, 
for the only difference between two mirrors, as far as nuclear forces are 
concerned, is that some n-n interactions in one are replaced by p-p 
interactions in the other. This fact is most easily seen if we consider 
(for counting purposes only) a core, comprising equal numbers of protons 
and neutrons, which is the same for both nuclei, plus an odd neutron 
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in the one case or an odd proton in the other. The extra neutron in 
B", for example, interacts with the protons and neutrons in the “core” 
and has a certain number a of n-n bonds and another number b of n-p 
bonds. In C" the a bonds will be of the p-p character, while the b 
will remain of the n-p type. If now the p-p bonds formed in C" differ 
in strength from the n-n bonds of B", there will be a difference in the 
masses of B' and C!! from this effect. It is important to note that 
this result, though here derived with a crude mod 
general and is independent of such considerations as 
all the various n-p bonds are equally effective. 

In order to compare the contribution of the purely nuclear forces to 
the binding energies of B'' and C" we write the masses in the following 
form: i 


el, is actually quite 
whether, for example, 


MŒ) = SMH) + 6M(n) — 2¥ + Ee (25.280) 
M(C") = 6M(H') + 5M(n) — 2x + 2b (25.28) 


where the terms Ey, Ey represent the binding energy due to nuclear 
forces and the terms Eg and E; represent the electrostatic energy of 
the protons. An approximate expression for the latter was given in 
Eq. (25.17) (Sec. 25.6) ; we find, from this expression, E(B") = 5.50 MeV, 
E,(C") = 8.25 MeV. Subtracting the second of the relations above 
from the first and solving for the difference of the nuclear binding ener- 
gies, we obtain 5 


Ty — EE = M(C") — MB") + Moy - Many + 2 — Zt 


c ee T? 


(25.29) 


Inserting the measured values of the mass differences times c2, measured 
in MeV, 


Ey — Ey = 1.981 + 0.782 + 5.50 — 8.25 = 0.01 MeV 


Since the average binding energy per particle is 5 or 10 MeV, it would 
appear that the assumption that p-p and n-n forces are equal is quite 
well supported by the observations. Of course the result depends to 
some extent on the validity of the electrostatic correction, but since 
this quantity occurs as a difference, even rather major changes in the 
assumed model could hardly alter the conclusion. Among the 20 or so 
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mirror pairs for which the masses are known, the apparent differences 
in the nuclear contribution to the binding energies range from a few tens 
of keV up to a few hundred. The largest discrepancies occur among the 
lightest nuclei, where the assumption of a uniform charge distribution 
might be expected to be least accurate. We shall have more to say about 
the comparison of mirror nuclei in a later section. 


25.12 Particle Groups 


It is frequently found in the study of the products of nuclear reactions 
that particles of more than one energy (or Q value) are present. This 
multiplicity of groups was early observed in the natural a-radioactive 
elements (Sec. 10.1) and was there interpreted as indicating the existence 
of excited quantum states in the residual nucleus. In Fig. 25.16 is 
shown a part of the spectrum of proton groups observed in the bom- 
bardment of B!’ by deuterons. The energy of the protons was measured 
in a semicircular magnetic spectrograph (see Fig. 25.15) placed at 99° 
to the incident beam, which in these observations had an energy of 
1.510 MeV. The group labeled 1 on the figure corresponds to the 
transition to the ground state of B'. From the figure, one can estimate 


the magnetic rigidity Br of these particles as 0.448 Wb/m. The energy 
is then 


E p? Br)? e? 12 
à z = 2M = (B zm = 1.5 X 10- J = 9.6 MeV 
From Eq. (25.26), we obtain 


Q = -AE + HE, = —1.23 + 10.5 = 9.3 MeV (25.30) 


The 1967 value from this and other experiments is 9.2314 + 0.0006 MeV. 
That the group is properly identified can be tested by measuring E, at 
some other value of the incident deuteron energy; if the Q value is 
the same, the mass numbers appearing in Eq. (25.30) have been correctly 
assigned. By this test, it was ascertained that groups 1, 3, 6, and 7 
pertain to the present reaction, while the remaining groups are associ- 
ated with some contaminants in the target. The latter Point was 


checked by separate bombardment of targets composed of 
contaminant materials. of the suspected 


er 
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Fig. 25.16 Part of the spectrum of proton groups from the reaction B'°(d,p)B4. [Van Palter, Buechner, and Sperduto, Phys. 
Rev., 82:248 (1951).] 
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The Q values for the B°(d,p) BU groups of Fig. 25.16 are listed in 
Table 25.2. In the right-hand column are indicated the differences 


Table 25.2 Proton groups! observed in the 
reaction B" (d,p) Bu 


Proton group Q,. MeV Excitation energy in BU 
=a 
J 9.2329 + 0.0034 0 
3 7.1083 2.1246 + 0.0011 
6 4.7871 4.4458 + 0.0021 
YÀ 4.2137 5.0192 + 0.0024 


1 Browne and O'Donnell, Phys. Rev., vol. 149, p. 767, 1966. 


between each Q value and that corresponding to the ground state. These 
differences give directly the energies of excited states of Bu, In Fig. 
25.17 is shown an energy-level diagram which illustrates the energetics 
involved. The difference in mass between the initial ingredients, B10 + 
H?, and the final products, BU + H; is Q/c?, equivalent to 9.231 MeV, 
If the total energy is plotted ona vertical scale, the initial state (ignoring 
for a moment the bombarding energy) lies this amount higher than the 
final state. Since the zero of energy is arbitrary, we may for convenience 


the difference M(B! + He — H) x ce? — M(B») x c? by a 9.231-MeV 
Vertical displacement. All energies on the diagram are then referred to 
the ground state of Bu, Considering now the 1.510-MeV bombarding 
energy, we can see that a part of this energy goes into kinetic energy of 


of the nuclei. A straightforward com utation shows that a fraction 
Mo/ (Mı + Mo), or # in the present instance, of the bombarding energy 
is available in the system in which the center of mass is at rest. The 
total energy available in the center-of-mass system, relative to the 


the transition to the ground state, represented by the longest slant arrow 
on the diagram, the proton and B" recoil share this energy in euch 8 

sys- 
tem are then obtained by adding vectorially the velocity of the center 
of mass.) If the B" is left excited in the 2.1-MeV or higher levels; ‘the 
kinetic energies will be correspondingly less, as indicated by the shorter 
arrows. Many other proton groups are known in addition to those in 
Table 25.2; the corresponding levels are shown in the diagram. The 
Subsequent decay of most of the levels shown proceeds by y radiation, 


either directly to the ground state or by cascades through lower states; 


`~ 
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B10 + H? — H! 


Fig.25.17 Energy-level diagram for B", Slanted 
arrows represent proton groups; vertical arrows 
show 7-ray transitions. 


some of the possible combinations are indicated by vertical arrows on the 
diagram. In many cases, it has been possible to check the level energies 
obt: 'ned from particle groups by direct measurements of the y-ray 
energies. 

The energy levels represented in Fig. 25.17 do not necessarily repre- 
sent the energy states of any single nucleon. Just as in the similar 
atomic diagrams, a given level represents the energy of the system as a 
whole and need not refer to any specific model; thus the line at 2.12 MeV; 
for example, means only that at this energy above the ground state there 
exists a particularly long-lived configuration or mode of excitation, which 
may involve one.or many particles.. For this reason also, it is convenient 


The Nucleus 713 
to plot energies from the ground state (as is done in x-ray diagrams) 
rather than from the ionization potential, which, in: the nucleus, has 
various values for different particles. 


25.13 Nuclear Resonances f 


The yield of a nuclear reaction, i.e., the number of processes observed 
per incident particle, usually depends strongly on the bombarding energy. 
A curve showing the yield as a function of energy is called an ezcitation 
function. With charged particles of low energy, the penetrability of the 
coulomb barrier increases rapidly with energy, and the yield will reflect 
this variation. In addition, sharp peaks, called resonances, may also be 
observed in the excitation function, and the properties of these resonances 
are of considerable interest to the nuclear physicist. 

A striking example of a resonance effect occurs in the reaction 
Li"(p,y)Be®. When Li? is bombarded by protons, there is produced, in 
addition to the « particles, which were discussed earlier, a y radiation of 
more than 17-MeV energy. The process responsible for this y ray is the 
simple capture of the proton by the Li? nucleus, forming Be®, which then 


radiates the excess energy in 1 quantum. From the known masses, 
the Q of the reaction is 


M(Li’) + M(H") — M(Be') = 0.018521 u 
Q = 17.252 MeV 


Qe 
c? 


The energy available for the y radiation is the 
of the proton’s kinetic energy which is ayailabl 
system 


Q value plus that part 
e in the center-of-mass 


E, =17.252+%E, MeV (25.31) 


where E, is the bombarding energy measured in the laboratory coordinate 
system, i.e., the system in which the target nucleus is at rest. 

If the yield of y radiation from a very thin layer of lithium is 
measured as a function of bombarding energy, the curve shown in Fig. 
25.18 results. The principal feature of this curve is a sharp maximum 
in the yield at a proton energy of 441 keV; the yield is reduced to one-half 
the maximum value by a change in either direction of only 6 keV. The 
observed variation of the yield is well approximated by the Breit-Wigner 
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Yield of yrays 


Proton energy, keV 


Fig. 25.18 Resonance in the yield of y 
rays from the reaction Li?(p,7)Be®. [T. W. 
Bonner and J. E. Evans, Phys. Rev., 73: 666 
(1948).] 


resonance formula 


T?/4 


E-E) FTA ane 


Y(E) = A(E) 


where Y(Æ) is the yield of y rays at the proton energy Æ, and Æ, is the 
value of the proton energy at the peak of the curve. The quantity T 
is the width of the curve in energy units, measured between the two points 
where the yield is one-half the maximum. The factor A(Z) is a function 
of energy which varies sufficiently slowly to be disregarded for our 
purpose. 

The form of Eq. (25.32) suggests that at the energy corresponding 
to E,, the system formed by Li? + H! has an approximately stationary 
quantum level. The remarkable feature of this level is that it lies in the 
region which the atomic spectroscopist would refer to as the continuum. 
Let us consider for a moment the analogous situation in the spectrum 
of the hydrogen atom. There the solutions of the Schrédinger equation 
are of two kinds, depending on the sign of the total energy measured 
with respect to the energy of the system comprising an ion with the 
electron at rest at infinity. If the total energy is negative, corresponding 
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to a bound electron, the solution yields a set of discrete, nearly sta- 
tionary’ quantum states. If, on the other hand, the energy is positive, 
i.e., if the electron has kinetic energy at infinity, a continuum of states 
is allowed. When a hydrogen ion captures an electron in one of these 
positive-energy states, the energy available is equal to the ionization 
potential of 13.6 eV plus the kinetic energy of the electron, measured 
in the system in which the center of mass is at rest. If this energy is 
radiated in 1 quantum, the resultant radiation forms part of the continuum 
beyond the limit of the Lyman series. In Fig. 25.19a is shown a level 
diagram for the hydrogen atom illustrating the bound, discrete states 
below the ionization potential and the continuum above; a transition 
corresponding to capture of a positive-energy electron is indicated by 
the vertical arrow. Because there exist no discrete states. of positive 

1 Nearly stationary in the sense that if the interaction with the radiation field 


is included, the quantum state decays in time, corresponding to a radiative transition 
to a lower level. 


17.252 
Li? + H 


H atom 


(a) (b) 


Fig. 25.19 Energy states measured from the ground tate fi 
the hydrogen atom (in electron volts) and (b) Pis Be? Ha 


MeV), nucleus (in 
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energy, the probability of capture of a free electron is a monotonic 
function of energy. 

The nuclear case is illustrated in Fig. 29.19b. The ionization 
potential is now the separation energy of the proton, 17.252 MeV, 
-neasured with respect to the ground state of Be’. Indicated on the right 
side of the diagram is the yield curve of Fig. 25.196, with the yield now 
plotted horizontally, to the left, against energy on a vertical scale. The 
state formed by 441-keV protons lies at 17.64 MeV (Eq. 25.31), and the 
transition to the ground state results in a y ray of this energy. Several 
other excited levels of Be? are also indicated. 

The width of any quantum state is determined by its lifetime, for 
according to the uncertainty principle, if a system exists only for a time 
Al, its energy can be defined only to an accuracy AE = hAl. Actually 
all atomic and nuclear quantum states, except the ground states, have 
finite widths because they can decay to lower states by radiation; they 
represent, for this reason, only approximately stationary solutions of the 
Schrédinger equation. The lifetime is determined by the ease with 
which the decay can proceed. Bound states decay chiefly by y radia- 
tion,! a process which is relatively slow ona nuclear time scale, and hence 
are ordinarily quite narrow. On the other hand, if a state is to be even 
approximately stationary in the unbound region, there must be some 
mechanism which prevents or delays particle einission. R 

A part of the mechanism which delays the emission of charged 
particles and makes possible the existence of relatiyely narrow levels 
above the binding energy is the nuclear barrier. For even if it is ener- 
getically possible for a particle to leave the nucleus, it may have to 
strike the barrier at the nuclear surface many times before it escapes. 
The phenomenon of radioactive æ decay among the heavy elements 
is an example. In light nuclei, the barrier is less important but may still 
play a significant role when the available energy'is low. In the case of 
the Lit(p,y)Be8 resonance, once the state is formed, it is mainly the 
coulomb barrier which delays the reemission of the proton and the 
combination lives sufficiently long for y emission to be possible. Cal- 
culation of the barrier effect gives the result that, once the proton is 
inside the Be® nucleus, the probability of escape per appearance at the 
nuclear surface is about 10-2. The velocity of the proton (corresponding 
to its roughly 10-MeV kinetic energy inside the barrier) is about 5 X 
10’ m/s. Tf the proton simply oscillated back and forth across the 
nucleus, without interacting with the other nucleons, it would make 
100 traversals in a time of the order of 100 X 6 X 1015/5 X 107, or 
10- s. The observed mean life of the level is At = &/T = 6 X 10-2 s. 


ternal conversion, nuclear pair formation, or 6 decay occur 
n to, y radiation, but these are also slow processes. 


1 In some cases in 
instead of, or in additio; 
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In view of the uncertainties in the calculations we may regard this fair 
agreement as evidence that the long life of this level is mainly ascribable 
to the barrier effect. 

The time required to radiate a y quantum is estimated to be about 
10-!° s, so that about one interaction in a thousand will result in y radia- 
tion; in the remainder of the cases the proton reemerges seeming to have 
been elastically scattered. This process, called resonance scallering, 
results in a characteristic interference with the normal Rutherford 
scattering. 

When several types of decay are possible for a given quantum state, 
the total decay rate is the sum of the decay rates for the individual 
processes, and the reciprocal of the mean life is the sum of the reciprocals 
of the individual mean lives 

1 1 
T 


=7+ 


Teia 
ERA A 


b 
where la may represent the mean time required for a proton to escape, 
f the mean life for y radiation, and {4 + - + - , other conceivable proc- 
esses. It follows then that the observed width of a level 'T may be 
regarded as made up of partial widlhs Ta = h/t, characteristic of the 
individual modes of decay. 

In the present instance, it is energetically possible for the level to 
decay by a-particle emission; in fact the reaction Li?(p,a)He! has a Q 
value of 17.347 MeV. With this amount of energy available, the a 
particles might be expected to leave rapidly, contributing thereby an 
extremely large width to the level. That this does not occur is attributed 
to the operation of a rigid selection rule. The character, as to total 
angular momentum and parity, of the compound system formed hy 
proton bombardment of Li? depends upon the orbital angular momentum 
brought in by the protons—different l values corresponding in the classi- 
cal sense to different impact parameters—and at any given bombarding 
energy, several different kinds of compound systems may be formed. 
Thus protons with | = 0 (head-on collisions) will produce states with 
[(Be)’ = I(Li?) + [(H') = $ © į = 2 or 1, by vector addition. With 
l = 1 protons, states of /(Be’) = 3, 2, 1, or 0 may be formed. States 
with odd total angular momentum cannot possibly decay into two 
identical particles with zero spin,’ and so if the Be? state in question 
has / = 1 or 3, it cannot produce a particles and its width will be deter- 
mined entirely by the partial widths for proton or y emission. It has 

1 Fhe relative orbital angular momenta of two identical 


are restricted to even integral values by the requirement t 
symmetric. 


ì Bose-Einstein Particles 
hat the wave function be 
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been confirmed that the state actually has J = 1 and is formed by | = 1 
(p-wave) protons. i 


25.14 Liquid-drop Model 


There occur many cases of resonances much too narrow, i.e., much too 
long-lived, to be accounted fer by barrier effects, Conspicuous among 
these are the extremely sharp resonances observed in capture of slow 
neutrons (Sec. 25.15). It was in an attempt to explain these cases that 
N. Bohr proposed the liguid-drop model of the nucleus in 1936. ` Prior to 
this time, most theories of the nucleus were based on analogy with the 
atom, in the sense that the nuclear particles were thought of as moving 
more or less independently of one another in a potential well which 
‘simply represented the average effect of the other nucleons and which took 
no other account of their existence. The resonances were thought to 
arise mainly from the motion of a single nucleon, and interactions with 
other particles were treated by perturbation methods, just as in atomic 
theory. : 

In the liquid-drop model the interaction of nucleons is treated as a 
major effect, and any incoming particle is regarded as almost immedi- 
ately losing its identity once it enters the nucleus. The course of a 
nuclear reaction is thus divided into two distinct stages. In the first 
stage, the incoming particle enters the target nucleus, forming a com- 
pound nucleus, and quickly shares its energy with nucleons already pres- 
ent, so that ‘no single particle has sufficient energy to escape. The 
second stage, the decay or disintegration of the compound system, occurs 
some time later, when the energy is again accidentally concentrated on 
some one particle or when the energy is lost by radiation. In this view, 
the compound nucleus embodies more than a formal description of a 
transitory interaction between the incident particle and the target 
nucleus: it is a system which may have an independent existence for a 
time many’ orders of magnitude longer than the time required for its 
formation, and its intrinsic Properties have a decisive influence on the 
outcome of the reaction. There may be several ways of forming a given 
compound nucleus with a given energy, by various combinations of 
bombarding particles and target nuclei. Subject to certain conservation 
laws relating to the ‘angular momentum and parity, the character of the 
subsequent disintegration of the compound nucleus is considered to be 
independent of the details of its formation, thé outcome in a particular 
instance being determined entirely by competition between the various 
possible modes. i 


Asa simple analog for this picture of the course of nuclear reactions, 
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Bohr suggested the liquid drop. In such a drop, the binding forces 
between molecules hold the system together and prevent evaporation 
unless heat is added from outside. We suppose now that ‘an additional 
molecule is introduced; since it was initially free, it must gain kinetic 
energy in going through the surface of the drop. «Th principle, it gains 
sufficient kinetic energy to permit its immediate reevaporation the next 
time it appears at the surface; it is likely, however, to collide with 
another molecule and lose some of its energy in the meantime. In the 
course of further collisions, theenergy is divided among all the molecules 
present and no one has sufficient energy to evaporate. From the point of 
view of the drop as a whole, the temperature has been raised slightly. 
The condition of the drop is thus defined completely by the number of 
particles and the total energy available, and its future behavior, e.g., 
the eventual evaporation of some molecule through an accidental, con- 
centration of energy, will be quite the same as if the extra energy had 
been supplied to it in the form of heat. 

Because of the many ways in which the extra energy made available 
by the capture of a particle can be shared among the other Particles in 
the nucleus, a rather long time may elapse before the compound nucleus 
decays. Consequently the energy of the compound nucleus may be 
defined with relatively sharp limits. Tt is observed, e.g., in the capture 
of low-energy neutrons, that resonances only a fraction of an electron 
volt wide may occur. From the relation AE Ai ~ h, We find for the 
mean life of a level 1 eV wide, At = 0.7 X 10-15 s. Comparing this 
number with the characteristic time, of the order of 10-2? s, for a single 
travérsal of the nucleus by an incoming neutron,’ we find that the 
feutron travels some 10° or 107 nuclear diameters during the mean life 


Cem 
N34 n : 
Ct + He (N44) (Bo + Hes 1 š J 
i C2 + H? 
N“+ 


The compound nucleus here (N**) has an excitation ‘energy of more than 
10 MeV; with the extra energy brought in by @gierons of 2 MeV or more, 


1 Inside the nucleus, the kinetic energy is about 10 to 15 MeV, corres 
to'a velocity of about 5 X 107 m/s. The diameter of a medium-weight n 
about 1 to'1.5 X 10716m. “ ft i 


Siam 
i i 


ponding 
ucleus is 


tin EEEN 


720 Introduction to Modern Physics 


it can eject a proton, a neutron, or an a particle, or it can reemit a deu- 
teron. Resonances have been observed in the first two reactions and 
presumably occur for the others also. The same compound nucleus can 
be formed by bombarding C' with protons, The disintegration of a 
compound nucleus of given energy is independent (subject to conserva- 
tion of spin and parity) of the mode of formation and emphasizes the 
dominant role which the properties of the compound nucleus plays in 
determining ‘the course of nuclear reactions, P 


25.15 Neutron Reactions 


At the time of the discovery of the neutron, it was realized that such a 
particle might provide a powerful tool in inducing nuclear reactions, and 
the first cloud-chamber studies of neutron recoils in nitrogen gave evi- 
dence. for a number of transmutations produced by neutrons. The 
efficacy of the neutron is attributable to the fact that it has no charge 
and hence has little or no interaction with the atomic electrons or with 
the coulomb field of the nucleus. When 'a beam of protons passes 
through matter, by far the greatest number are brought to rest through 
electronic interactions, and only a few produce a nuclear reaction. Neu- 
trons travel through matter indifferent to the electric fields, and most 
of them end up by producing a nuclear reaction. The fact that they 
do not produce ions directly makes them somewhat more difficult to 
detect, but they can be counted with acceptable efficiency through their 
nuclear encounters. i 

At low bombarding energies (less than a few MeV) positively 
charged particles find it difficult to enter the target nucleus because 
of the coulomb barrier. In the classical theory, they could not enter 
unless they had sufficient energy to surmount the barrier, but wave 
mechanics admits a small penetrability. For a 200-keV proton striking 
an O'* nucleus, for example,-the probability of penetrating is of the order 
of 10-3 as great as for a neutron of the same energy. At lower energies 
and ‘or higher atomic numbers the disparity is greater. 
a. Production of Neutrons The bombardment of beryllium with a par- 
ticles provides a convenient source of neutrons for experiments. Where 
high intensity is not required, a radioactive material is simply mixed 
with beryllium powder and sealed in a small container. With polonium 
a particles (5.3 MeV), about 1 a Particle in 104 Produces a neutron, so 
that the number of neutrons from a source of, say, 10 mcuries strength 
is 3.7 X 108 X 10-4 = 3.7 x 194 neutrons/s. Stronger sources can be 
made with artificially accelerated particles. For example, in the reaction 
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Be?(d,n)B* with 10 pA of 10-MeV deuterons, some 4 X 10" neutrons 
are produced per second. For many purposes, it is desirable that the 
neutrons be homogeneous in energy. Monoenergetic neutrons can be 
produced in several reactions; a commonly used one is Li?(p,n)Be?, 
which has a negative Q value of 1.644 MeV. Neutrons first appear at 
a-proton energy of 1.881 MeV and can be produced with any desired 
energy! by varying the bombarding voltage. A copious source of low- 
energy neutrons is the nuclear reactor. 


b. Neutron Tolal Cross Section Because neutrons are essentially unaf- 
fected by the atomic electrons, a measurement of the attenuation of 
a neutron beam in passing through matter provides direct and useful 
information on the properties of nuclei. As a quantitative measure of 
the probability of an interaction between a moving particle and a nucleus, 
the term cross section is used (Sec. 7.9). To define this quantity in the 
present connection, we consider a parallel beam of I particles per second 
crossing an area A in which there is located a single target nucleus; 
we assume that of the J particles a certain number i suffer some kind of 
interaction with the target nucleus, for example an elastic scattering. 
The cross section for this process is then defined as 


_ number of interactions/target nucleus hut, 


number of incident particles/unit area _ /A 


or, equivalently, 


number of interactions/ (target nucleus/unit area) tA 
number of incident particles I 


c= 


Thus, if a target contains n nuclei per unit area Perpendicular to an 
„incident beam of J particles per second, the number of interactions per 
second is Jon. In a layer of material of thickness dz containing N nuclei 
per unit volume, the probability of an interaction is 


dP = oN dz ` (25.33) 


The cross section has the dimensions of an area and is, in the classical 
sense, simply equal to that portion of the projected “area” of the target 
nucleus which is effective for the process in question. 

The total cross section o: for interaction of neutrons with nuclei is 
made up of several terms, including the cross sections for elastic scatter- 


1 For proton energies greater than 2.37 MeV, an exci 
formed, and a second group of néutrons appears. Penns state ot Ber Pabe 
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ing, inelastic scattering, disintegration, and simple capture. _For rea- 
sonably fast neutrons, the total cross section can be crudely estimated as 
equal to the projected area of the nucleus 7R?, where R is the radius of 
action of the nuclear force field. Actually, because even with high- 
energy neutrons wave-diffraction effects are important, an effective area 
of 27 R? is a better estimate. Experimental values of ø, can be obtained 
by measurement of the attenuation of a narrow beam of neutrons in a 
known thickness of material. If such a beam, of intensity I) neutrons 
per second, is incident on a slab of material of thickness T 


the number of 
neutrons emerging unchanged in direction is 


I = Ig: (25.34) 
From a measurement of I/Io, the cross section and hence the nuclear 
radius can be found. The values obtained generally agree with Eq. 
(25.5), with values of Ro lying in the neighborhood of 1.4 X 10-15 m. 


For a medium-weight nucleus, +R? is an area of the order of 10-78 m2, or 
1 barn. 


c. Neulron-Proton Scallering Generally, the most probable result of a 
collision between a neutron of moderate energy—of the order of 1 to 
5 MeV—and a light nucleus is a simple elastic scattering. This was the 
process observed by the Joliot-Curies and by Chadwick in their experi- 
ments on the recoils produced in hydrogen and in nitrogen. One type 
of neutron detector consists essentially of an ionization chamber lined 
with some hydrogenous material; the neutrons eject protons from the 
wall by elastic collision and the protons are counted through their ionizing 
ability. Measurements of the elastic scattering of neutrons in hydrogen 
are of considerable interest in that they lead to direct information on the 
character of nuclear forces (Sec. 25.66). 

An arrangement for a scattering experiment is shown in Fig, 25.20. 
A collimated monoenergetic beam falls on the scatterer S. Those 
neutrons which are scattered through the (variable) angle 6 are counted 


Fig. 25.20 Neutrons passing th: h 
collimator C are scattered ia the beck t 
material S and counted at detector D. 
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Fis. ne Scattering cross section for neutrons in hydrogen (1 barn = 
“#9 m3), 
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or polyethylene. The observed Scattering is then the sum of the effects 
from the hydrogen and carbon nuclei, and the necessary correction is 
determined by an auxiliary experiment with a carbon scatterer. The 
result of the experiment is expressed in terms of the number of neutrons 
Scattered into the detector as a function of the angle ô and of the energy 
of the incident neutrons. Ifthe solid angle subtended by the detector and 
the number of scattering nuclei are known, the differential cross seclion 
de/dQ, that is, the cross section for scattering into unit solid angle, can be 
calculated. For neutron energies above 1 eV and less than 10 MeV, the 
Scattering in hydrogen turns out to be spherically symmetric when the 
results are expressed in the center-of-mass system. The total cross sec- 
tion for elastic scattering, obtained by integrating the differential cross 
section over all values of 0, exhibits a smooth variation in energy, decreas- 
ing monotonically from the value 20.4 barns at a few electron volts to 
about 2 barns at 4 MeV (Fig. 25.21). 


d. Neutron-induced Reactions In addition to undergoing elastin scat- 
tering, neutrons produce nuclear reactions, of which many result in 
radioactive nuclides. An extensive study of these effects made by 
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Fermi and his collaborators in the years 1934 to 1936 led to the identifica- 
tion of a number of reactions of this character. The experimental 
arrangement in these early investigations was simple. A neutron source 
(0.5 curie of radon mixed with beryllium powder) was used to irradiate 
a sample of some element for a short time, after which the sample was 
transferred to the neighborhood of a Geiger counter and examined for 
radioactivity. Generally, the samples were in the form of cylinders 
which could be slipped over the source and then over the cylindrical 
Geiger counter in order to obtain the most favorable geometry. When 
an activity was produced, the half-life was determined, and wherever 
possible a chemical identification of the radioisotope was made. Of 
some 60 elertients investigated, about 40 showed radioactivity. Among 
the lighter elements—F, Mg, Al, Si, and several others—it was established 
that the activity resulted from (n,p) or (n,a) reactions. For example, 
bomb Enact of silicon produced radioactive aluminum with a half-life 
of HHR 3 min; the process is 


Si? + nt Al + H! 


Reactions of the character X(n,p)Y, where X is a stable nuclide, 
invariably lead to radioactive products, for Y is an isobar of X, differing 
in charge by one unit, and according to the rules developed in Sec. 25.5, 
it is not possible for both X and Y to be stable. For the same reason 
(p,n) reactions also lead to unstable products when the target is stable. 
Most (n,p) reactions are endoergic, because the product nucleus is 
usually unstable by more than the 0.78 MeV which the n — H! mass 
difference makes available. A few cases exist’ in which (n,p) reactions 
have positive Q values; He?(n,p)H? and N'4(n,p)C'* are examples. 
The (p,n) reactions are all endoergic by more than 0.78 MeV. 

The (n,a) reactions are generally endoergic in light- and medium- 
weight nuclei and are relatively weak because the coulomb barrier 
discourages the escape of low-energy a-particles from the compound 
nucleus. Two important exoergic reactions are 


Lit(n,a)H? Q = 4.786 MeV BY°(n,a) Li? Q = 2.792 MeV 


Both have high cross sections for low-energy neutrons and, because they 
produce charged particles, are much used for neutron detectors. If a 
proportional counter is filled with boron trifluoride gas—preferably 
enriched in B'°—neutrons entering the counter produce a particles which, 
because of their dense ionization, give rise to large pulses. The asso- 
ciated amplifier can then be set to reject small pulses due to y-ray 
background, and the counter responds only to neutrons. 
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In nuclei with high Z, the coulomb barrier becomes so high that 
neutron reactions leading to charged-particle emission are improbable 
except with high (>10 MeV) bombarding energies. The important 
processes in heavy nuclei with moderate energies are elastic scattering, 
inelastic scattering, and simple capture followed by y radiation (Sec. 
25.10). Among the reactions of this type which were identified by 
Fermi’s group are Au!’ (n,y)Au!8, in which a strong activity of 2.7-day 
half-life is produced, and I?7(n,y)I'28, resulting in a radioisotope of 
iodine with a half-life of 25 min. Asa rule, the cross section for radi- 
ative capture is small compared with that for elastic and inelastic 
scattering when the neutron energy exceeds a few keV. 


e. Slow Neutrons In the course of their investigations of radioactivity 
induced in silver by neutrons, Fermi and his collaborators observed that 
the amount of activation produced appeared to depend on the geometrical 


containing substances near the neutron source and silver sample and, 
finally, that surrounding the whole arrangement with a large quantity 
of water or paraffin enormously increased the yield of radioactivity. 
Further tests showed that the enhancement of the yield in the presence 


When neutrons pass through a substance containing hydrogen, they 
are strongly scattered and rapidly lose energy in elastic collisions. A 
neutron striking a proton transfers an amount of energy [see Eq. (25.1)] 


AE = Ep sin? 40 


where 90 is the angle of scattering in laboratory coordinates. For neu- 
trons in the energy region of interest here, the scattering is isotropic in 
center-of-mass coordinates, and all possible values of the energy loss are 
equally probable. It follows that the energy of the neutrons is reduced 
on the average by one-half in a collision with a proton; the average energy 
of a group of neutrons which had initially the same energy E, will after 
n collisions per neutron be 


However, in determining the average energy, the relatively few neutrons 
with high energy are given a disproportionate weight; a better index of 
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the energy distribution for our purposes is the median energy. The 
median energy can be shown to be approximately, again for collisions 
in hydrogen, 


From this we can compute that of a group of 1-MeV neutrons, half will 
have energies less than 1 eV after about 14 collisions. The mean free path 
per collision, defined as the average distance traveled between collisions, 
is given by A = 1/cN, where ø is the scattering cross section and N the 
number of protons per unit volume. Inserting the measured values of 
a (Fig. 25.21) we find for 1-MeV neutrons a mean free path of about 3.3 cm 
in water and for 1-keV neutrons, about 0.7 cm (we ignore the collisions 
with oxygen nuclei since they have relatively little effect on the energy). 
Thus a layer of water 10 cm thick around the neutron source reduces 
most of the neutrons to an energy where they are in equilibrium with the 
thermal motion of the water molecules (about 7; eV at 290°K). Neu- 
trons reaching thermal energies are moving in random directions, and 
their motion is not unlike the diffusion of a gas. 

The fact that slowing down the neutrons greatly increases their 
effectiveness in producing nuclear reactions is easily explained when the 
problem is examined from the point of view of wave mechanics. In 
our earlier discussion, we estimated the maximum cross section for inter- 
action of fast neutrons with nuclei by appealing to a simple geometrical 
picture of the projected area presented by a nucleus to an incoming par- 
ticle. A usual measure of the applicability of such geometrical pictures 
is the ratio of the reduced wavelength \/27m to the nuclear dimensions. For 
a fast, say 5-MeV, neutron, the reduced wavelength is 


A h 
5z = ———— = 2 —15 
2r vV2EM SU m 


which is sufficiently smaller than the nuclear radius for visualization 
of the neutron’s trajectory to present no difficulty. For neutrons of 
1 eV energy, on the other hand, \/2r = 0.5 X 10-"! m, many times 
larger than the nucleus. In this situation it becomes appropriate to 
think of the nucleus as a small diffracting center in a very long wave. 
Instead of trying to trace individual trajectories we regard the incident 
neutrons as a beam, with a certain average probability of having a 
neutron in a unit volume. In such a beam, containing I) neutrons of 
velocity v crossing unit area per second, the number of neutrons per unit 
volume is p = Io/v. The probability per unit time that a neutron will 
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be captured is proportional to the density of neutrons 
I 
Neapt op = 


But the number of captures per unit time is, by definition, proportional 
tool, Thus 


cass «= Naot a L (25.35) 

Equation (25.35), often referred to as the 1/» law, is accurately 
confirmed in many neutron processes. The cross section for the reac- 
tion B'°(n,a)Li’, for example, shows no appreciable deviation from 
the 1/v dependence over the energy range from 10-2 to nearly 10? eV, 
and the activation of silver follows the law from 10-? to about 1 eV. 
In some nuclides, the observed cross sections for thermal neutrons 
(E = kT = xs eV) are many thousand times greater than those for fast 
neutrons. One isotope of xenon has a thermal cross section of 3.5 x 
10° barns! 


f. Resonance Absorption If the cross section for activation of, say, 
silver is very large for slow neutrons, the absorption coefficient in silver 
of such neutrons should also be high. That this is indeed the case was 
shown by Amaldi et al. by measuring the activation produced in a 
silver sample with various thicknesses of silver absorbers wrapped around 
it. For silver, the thickness required to reduce the activation to half 
value was about 1 mm, indicating an average cross section [Eq. (25.34)] 
of some hundreds of barns. It would thus appear that a relatively thin 
layer of silver should absorb many of the slow neutrons and greatly 
reduce the effect of the hydrogen thermalizer on all activations. Tests 
made with iodine as the detector showed, however, that this was not so: 
the activation of iodine was only moderately affected by the silver 
absorber, and although an iodine absorber would reduce the iodine activ- 
ity greatly, it had little effect on silver samples. The implication is that 
neither silver nor iodine follows strictly the 1/v law but that both have 
extraordinarily high absorptions for certain different neutron energies. 
The activation of silver must then be mainly due to a narrow band of 
neutron energies—for which its absorption is correspondingly high—and 
that of iodine is due to another band of a different energy. It was shown 
by Amaldi and Fermi that these so-called resonance-absorption bands 
were in some cases only a fraction of an electron volt wide. These were 
the resonances whose discovery led to the formulation of the liquid-drop 
model of the nucleus. 
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Our information on these low-energy-neutron resonances has enor- 
mously increased with the development of more refined techniques. One 
of these has been the neutron velocity selector, schematically illustrated 
in Fig. 25.22. The neutrons are produced in the reaction Be®(d,n)B'9, 
using deuterons accelerated by a cyclotron, and ‘‘thermalized” by a few 
centimeters of paraffin. Slow neutrons pass through the material whose 
absorption is to be studied and to the detector some distance away. 
Velocity selection is accomplished by operating the cyclotron in such a 
way as to produce short bursts of neutrons and synchronizing the detector 
so that it counts only those neutrons whose time of flight lies in a selected 
narrow range. Ina typical arrangement, the distance to the detector is 
6.4 m; the time of flight for a 1-eV neutron is then 470 us. With a time 
resolution of 5 us, energies as high as 10,000 eV can be selected. 

Another instrument, with particular applicability to the low-energy 
region, is the crystal spectrometer. The de Broglie wavelengths of 
thermal neutrons are of the order of magnitude of the lattice spacings of 
crystals, and they are scattered by such a lattice according to the Bragg 
law. With the large neutron fluxes available from nuclear reactors, 
it is possible to obtain scattered neutron beams of acceptable intensity 

~ and high homogeneity in energy. The sample is placed in the beam, 
and its attenuation is measured as a function of the crystal angle. A 
curve for silver, showing two strong resonances, is reproduced in Fig. 
25.23. The close agreement obtained in the location of resonance ener- 
gies by the crystal and direct time-of-flight measurements provides an 
accurate check of the validity of the de Broglie relation for neutrons. 
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Fig. 25.22 Neutron velocity selector. Neutrons produced in the Be target 
are slowed down in the paraffin block and selected according to the time 
required to traverse the distance to the detector. 
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Fig. 25.23 Neutron capture resonances observed in silver with a crystal 
spectrometer. [L. B. Borst and V. L. Sailor, Rev. Sci. Instr., 24:141 (1953) 
and R. E. Wood, Phys. Rev., 951644 (1954).] 


25.16 Energy Levels of Nuclei 


One important result of the study of nuclear reactions is the location and 
identification of the excited states of nuclei. In the atomic case, it was 
the study of thé excited levels, as derived from spectra, which guided 
the theoretical development of our present model of the atom. Of all 
the various classes of nuclear energy levels we discuss here only a few 
which have a particularly straightforward interpretation. | 


Mirror Nuclei In Sec. 25.11, evidence that the 
am E w and n-n forces are equal was adduced from com- 
parison of mirror nuclei. From the arguments presented there, not 
only the ground states but also the excited states of mirror nuclei should 
agree in energy, once. the electrostatic energy and the neutron-proton 


` 
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mass difference are taken into account. Furthermore, corresponding 
levels should agree in other properties, such as angular momentum and 
parity, if the hypothesis is correct. Excited levels of the mirror nuclides 
do agree remarkably well. As a single example, the energy levels of 
Li? and Be? are shown in Fig. 25.24. In this figure the two ground 
states have been arbitrarily made to coincide in energy in order to exhibit 
the alignment of the higher levels. The agreement of the higher levels 


is not quite exact, a small shift downward in the case of the n 
a ucle 
higher Z being often observed. pan! 


b. Even Isobars More can be learned about nuclear forces from a 
study of the even isobars. If we consider «Be! and ;B", for example 
as composed of a common core of four protons and four neutrons (again, 


7.19 


Fig. 25.24 Energy levels of Lit 
and Be’. The ground states are 
arbitrarily adjusted to coincide. 
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only for the purpose of counting the number of interactions of various 
types) plus two neutrons or a neutron-proton pair, respectively, we find 
that some of the n-n bonds in Be" are replaced in B not by p-p bonds, 
but by p-n bonds. If the nuclear binding energies of these two nuclides 
are found to be equal, we have evidence for the far-reaching assertion 
that, at least in light nuclei, the interaction between unlike nucleons is 
the same as that between like nucleons, i.e., that the forces are not only 
symmetric in the charge, state (see Sec, 25.6c) of nucleons but are 
charge-independent. 

A complication arises, however, in the comparison of Be! and BY, 
because certain p-n interactions in the latter are not permitted in the 
former when the proton is teplaced by a neutron. A:proton and neutron 
in the same quantum level may have either parallel or antiparallel spins; 
two neutrons, on the other-hand, can only have antiparallel spins. If 
the nuclear force is greater when the spins are parallel, as would appear 
to be the case from the neutron-proton Scattering experiments (Sec. 
25.66), the ground state of B' may have a character which is forbidden 
in Be. On the other hand, there should exist an excited state of B” in 
which the spins of the extra proton-neutron pair are antiparallel, and 
this state should have the same energy as the ground state of Ber, 
The same argument applies to the mirror of Be, viz., ¿C19 which has 
two extra protons. All the levels of these two nuclides should have 
counterparts in B1, but B1 may have levels which are not possible in 
either C! or Beto, 

The known low-lying energy levels of Be’®, B’®, and C° are shown 
in Fig. 25.25, in which a correction for the electrostatic energies and 
n — H! mass difference has been applied, so that the nuclear parts of 
the binding energies are shown relative to the ground state of B®, Bee 
has thus been shifted upward by 1.44 MeV and C° downward by 2.05 
MeV. The levels of B", C19, and Be!®, which are connected by dashed 
lines, are found not only to agree well in energy but also to exhibit other 
properties which confirm their close genetic relation. 

In general, the greater the difference between the numbers of pro- 
tons and neutrons in a nucleus with a given total number of particles, the 
more restrictions placed by the Pauli principle on the number of possible 
interactions and the smaller the number of levels as compared with 
configurations comprising equal numbers of neutrons and protons. In 
a formal way, this'property of a nuclear configuration may be expressed! 


1E. P. Wigner, Phys. Rev., 51:106 (1937). In the wave-mechanical treatment, 
the isotopic spin plays a role quite analogous to the ordinary spin. Just as the two 
spin states of a nucleon can be represented by components of a spin vector, the two 
charge states of a nucleon, i.e., neutron und proton, may be represented by components 
of the isotopic-spin vector. 
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Fig. 25.25 Energy levels of Be!®, B'°, and C. The 
ground states of Bet? amd C’° have been shifted by the 
electrostatic energy and the n — H! mass difference. 
Corresponding levels are connected by dashed lines. 


by the so-called isolopic-spin projection T; = 4(N — Z). The greater 
the value of T}, the fewer the permitted levels, and any level which is 
possible for a nuclide with a given 7; will also appear in all nuclides with 
lower absolute values of T;. The various common levels are said to 
represent components of an isolopic-spin mulliplel characterized by a 
quantum number T equal to the maximum value of T;.’ Thus all the 
states of B° have T; = 0; the ground state and first excited state do 
not occur in Be! or C! (for which Tp = 1 and — 1) and are therefore 
isotopic-spin singlets: T = 0, T; = 0. The ground states of Be! and 
C! and the 1.74-MeV state of B! comprise an isotopic-spin triplet: 
T =1, T; = 1,0, —1. Ina sense the various isobars comprise a kind 
of fine structure in nuclear spectroscopy much as the different components 
of ordinary spin comprise a fine structure in atomic energy levels. It 
is thus again emphasized that it is the total number of particles, and not 


the somewhat incidental nuclear charge, which principally determines 
the behavior of a nucleus. 
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25.17 The Shell Model 


We have discussed in previous sections the experiments which led to 
the development of the liquid-drop nucleus model for nuclear reactions. 
It is, as we have seen, an essential feature of this model that the nucleons 
are regarded as strongly interacting within the nucleus, and any picture 
envisaging separate particles moving independently of one another in 
an average central field of force would give results at variance with 
many of the facts of nuclear reactions. There has, however, been a 
simultaneous development in the opposite direction, in which it has 
been made increasingly clear that for some properties of nuclei the 
independent-parlicle model gives surprisingly good results. This develop- 
ment is closely connected with the discovery of the so-called magic 
numbers, configurations of neutrons and protons which exhibit certain 
rather special properties. 


a. Periodicilies in Nuclear Species A conspicuous feature of atomic 
structure is the marked periodicity in chemical properties. We saw 
earlier how this phenomenon found a straightforward explanation in 
terms of the electronic shells and how the closing of a shell or subshell 
made itself evident in a strong increase in the ionization potential 
(Fig. 15.3). It is natural to search for analogous effects in nuclei in 
terms of specially stable nuclear species. Among the lightest nuclei, 
the peculiar stability of nuclei of the type A = 2Z = 4n, where n is an 
integer, was recognized at an early state. This stability is exhibited 
in the pronounced peaks of the binding-energy curve of Fig. 25.3. This 
observation, and the fact that a particles are emitted by many of the 
naturally radioactive substances, led for a time to the belief that the 
a particle is a fundamental building unit of nuclei. That this is not so 
in any literal sense is evident from the fact that heavier nuclei contain 
more neutrons than protons and the extra neutrons are just as tightly 
bound as the protons. Among light nuclei, the preference for equal, 
even numbers of protons and neutrons is accounted for by the high 
symmetry of such configurations (Sec. 25.6). 

The suggestion that there are other shell-structure effects was first 


fed 


made in 1932 by Bartlett, who observed that the pattern of composition ` 


of the naturally occurring isotopes underwent a change at O'* and again ~ 


at A365, Between He‘ and 0", all the stable nuclides belong to the 
sequence Het-+ n + p t+n+p+-::. From O' to A*, the 
sequence becomes oet+ntn+p+tptntnt ‘+s. It was 
suggested that these changes might be associated with the filling of 
shells of neutrons and protons of given orbital angular momentum. In 
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the first s shell (orbital angular momentum = 0) there could be just 
two neutrons and two protons without violating the Pauli exclusion 
principle. The next, the “p” shell, with orbital angular momentum 1, 
has room for six neutrons and six protons and would be complete at 01%; 
The ‘‘d’’ shell, with 10 neutrons and 10 Protons, would be filled at A 
(Z = 18). In general, as in atoms, the number of particles of each 
kind in a shellis 2(21 + 1). Evidence for the existence of other shells 
was brought forward in 1933 and 1934 by Elsasser and by. Guggen- 
heimer, who found indications of particular stability for several values 
of neutron and proton numbers, among them Z = 20, N or Z= 50, 82 
and N = 126. Unfortunately the data available were insufficiently 
accurate, and the argument was weakened to some extent by the inclu- 
sion of several numbers which turned out not to be “magic” at all. 


b. Central-field Approximation Attempts were made to account for 
the existence of shells by calculations based on the Hartree central-field 


residual electrostatic repulsion or the spin-orbit effect, are taken into 
account by perturbation methods. 

As a first approximation to the central field in the nucleus, a simple 
Square-well potential is assumed. Since the potential is solely a func- 
tion of radius, the Schrédinger equation may be separated into radial 
and angular parts, and the angular-dependent solutions are identical 
with those of the hydrogen atom. There exist a number of stationary 
solutions, depending upon the depth and width chosen for the potential 
and these are characterized by the q 
and m; refer to the orbital angular momentum and its projection on any 
chosen axis and » is the radial quantum number, so defined that the 
number of nodes in the radial solution is given by! v» (excluding nodes 
atr = 0, but including r = œ). The energy of a given stationary state 
depends explicitly upon both p and l, and the level order is ls, lp, ld, 
2s, If, 2p, . . . , as shown in the left-hand side of Fig. 25.26. The 
actual energies may be different for neutrons and Protons, 
in heavier nuclei, where the electrostatic effect is importa: 


’ 


uantum numbers v, l, m, where | 


particularly 
nt. In any 


1 This » is not the same as the quantum number n used i 
In another commonly used Notation, however, the solutio; 


total quantum number n (our » + 1), and J; in this notation, the level order is 12, 2p, 
Geshe T, 


2 


1s ———_------. 


Fig. 25.26 Spacing and order of 
levels in an infinitely deep square 
potential well (left) and on the Mayer- 
Jensen potential (right). The total 
number of particles at each shell 
closing is indicated at the right. 


Case, with 2(2/ + 1) particles in a “shell,” closed shells should occur 
for 2, 8, 18, 20, 34, 40, 58, . . . neutrons, and similarly for protons. 
The numbers 50, 82, and 126 do not appear to occur in any natural 
way, although they can be produced by arbitrarily omitting certain 
terms, for example 2s and 2p. 


c. Spin-orbit Coupling At about this stage of development, the inde- 
pendent-particle model fell into disfavor, partly because the experimental 
evidence for shells among the heavier elements was something less 


Binding energy per nucleon, MeV 
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than clear and partly because the conspicuous success of the liquid-drop 
model made it seem that any picture based on a fixed central potential 
was doomed to failure. Evidence for periodicities in nuclear structure 
continued to accumulate, however, and the question was reopened by 
M. G. Mayer in 1948. In a survey of existing information on the nat- 
urally occurring nuclides, she noted that species with neutron or proton 
numbers 50, 82, or 126 are distinctly more abundant in the earth’s crust 
than their immediate neighbors, that there occur particularly large 
numbers of isolones—nuclides with the same neutron number N—for 
N = 50 and 82, and particularly large numbers of isotopes for Z = 20 
and 50. With these data, and certain others which we shall not enumer- 
ate here, she established the magic character of the numbers 50, 82, 
and 126 in addition to the well-recognized ones 2, 8, and 20. Supporting 
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Fig. 25.27 Magnified portion of curve showing binding ener, rinucleonde (HE: 
Duckioorthy Nature, 170:158 (1952).] By pe { 
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evidence for the special stability of magic-number nuclides, obtained in 
direct mass-spectrograph measurements, is shown in Fig. 25.27, in which 
the binding energy per nucleon (see Fig. 25.3) is exhibited on a magnified 
scale for mass numbers 40 to 240. Changes in slope occur at Ni®? (28 pro- 
tons), Sr’ (50 neutrons), Sn??? (50 protons), Ce! (82 neutrons), and 
Pb?°8 (82 protons, 126 neutrons). 3 

In 1950, Mayer, and independently Haxel, Jensen, and Suess, pro- 
posed a simple independent-particle model which led to the, observed 
magic numbers in straightforward fashion. Starting with a potential 
differing only slightly from a square well, they postulated a strong 
spin-orbit coupling, proportional to 1-s, which would split the zero- 
order states obtained from the central potential. Thus the lp state 
would be split into two: 1p, and 1p, (the subscript gives the total angular 
momentum for a single particle); and 1d would become ld} and 1d); the 
s state of course remains single. If the spin-orbit splitting is of the same 
order of magnitude as the separation between various l terms, the group- 
ing of states may be considerably modified. The grouping which they 
derived is shown in Table 25.3, and the spacing of the levels is shown on 
the right-hand side of Fig. 25.26. On the basis of strong spin-orbit split- 


Table 25.3 Grouping of neutron or proton states 
in the shell model with spin-orbit coupling 


State designation ls lp} ldy If; 1g; Thy 


lp dy lf dy Of 
253 2mp Ady fy 


‘Total neutrons or protons 2 8 20 50 82 126 
eee i Dhi ri niak TE ye ee 


ting for large `l values, the numbers 50, 82, and 126 become magic in a 
Straightforward manner. Among the light elements where the l values 
are low, in the ls and 1p shells, it is assumed that the spin-orbit splitting 
is not so great, and the level order and magic numbers (2, 8) are those of 
the square well. In the next shell, the close proximity of the ld and 2s 
states is a consequence of the assumed potential, and the spin-orbit 
effect again plays a minor role. It is possible that the spin-orbit effect 
may lower the 1/; state in thë next group sufficiently to make 28 a magic 
number; there is some indication for this in the relative stability of 2,Ni®?, 
The next shell contains If and 2p states closely grouped, and the spin- 
orbit effect depresses the 1g; state, so that this term appears in the shell 
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which closes at 50 nucleons. By the same procedure, shells at 82 and 
126 nucleons are accounted for. 


d. Single-particle Model To make more specific comparisons between 
the predictions of the independent-particle model and experiment, it is 
necessary to make some further assumptions. The most obvious of 
these, which is common to all models and which follows directly from the 
Pauli principle, is that the nucleons of a closed shell combine their spins 
and magnetic moments to give a resultant of zero. In the light nuclei 
He‘ and O'%, where N = Z, and in the doubly magic nucleus 5:Pb2%, 
where both N and Z are magic, the ground-state angular momentum is 
zero, but, since this is apparently a general characteristic of nuclei 
with even N and even Z, the point is not strong. As regards the next 
assumption, two extremes are possible: in the so-called single-particle 
model, it is assumed that all like particles, including those outside closed 
shells, couple together in pairs to give angular momentum zero. The 
angular momentum and magnetic moment of all even-even_ nuclei would 
then be zero—as observed—and that of even-odd (Z even, N odd), or 
odd-even nuclei would always be that of the odd nucleon. The single- 
particle model works surprisingly well in accounting for the observed 
angular momenta. According to this model, in the lp shell (Het to 
01°), the total angular momenta expected are 2, $, 3, 3,4, 4 for A = 5, 7, 
9, 11, 13, and 15, just as observed. Just above O' there is some uncer- 
tainty as to whether the s} or dj “orbits” should fill first: for O and F}2 
the observed values of J are $ and. For Ne?! and Na??, on the other 
hand, I = 3, which is a little difficult to account for. The remainder 
of the shell fills regularly to 1K (odd protons) or 165% (odd neutrons). 
In the next shell, the observed and predicted spins agree well in the main, 
but a few troublesome exceptions occur. 

The single-particle model has also had some measure of success in 
accounting for the magnetic moments of odd-Z, even- and even-Z, 
odd-N nuclei. It was in fact suggested as early as 1937, by Schmidt, 
that the magnetic moments of nuclei might be attributable to the motion 
of a single nucleon, with the contribution from its intrinsic spin moment 
and that from its orbital motion coupled in the same way as the corre- 
sponding electronic moments in an atom. Thus the total angular 
momentum I is considered to be the vector sum of the intrinsic spin of 
the nucleon s and an orbital angular momentum 1. The magnetic 
moment so produced depends upon the value of I for the odd nucleon, 
and this value is so chosen as to give the closest possible agreement with 
the observed moment. Although the numerical agreement between pre- 
dicted and observed moments is not very close, it is usually sufficiently 
good to permit determination of a unique l value and this value turns out 
in most cases to be just that predicted by the shell model. 
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25.18 The Collective Model 


The curious dichotomy of the behavior of the nucleus as a liquid drop 
in some experiments and as an open: structure with well-defined shells 
in others has occasioned no little perplexity. In the collective model 
it is assumed that the “core” of the nucleus, comprising the particles 
in closed shells, is not completely inert but can be distorted by the extra 
nucleons. | For nuclei with closed shells there is no angular momentum, 
and the equilibrium shape is expected to be spherical. However, if 
many nucleons exist outside closed shells, the interaction may be rela- 
tively strong, and the motion of the core so produced influences even 
the ground-state features. The nucleus then exhibits a permanent 
deformation arising both from the “loose” nucleons and from the nuclear 
core distorted by the interaction with them. Asa result of the deforma- 
tion, the nucleus may have a substantial quadrupole moment, a feature 
prominent in the rare-earth region. There is every evidence that 
nuclear matter is easily deformed, vastly easier, for example, than the 
charge cloud of an atom. This is not surprising when one remembers 
that the dominant force on an atomic electron is the coulomb field of 
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Fig. 25.28 (a) Theoretical levels for 
rotational band in a deformed even-A, 
even-Z nucleus. (b) Experimental 
levels for U» obtained by coulomb 
excitation. [Stephens, Diamond, and 
Perlman, Phys. Rep, Letters, 3:435 
(1959).] 
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the nucleus, which is spherically symmetric. For nucleons, on the other 
hand, the strongest interactions are with neighboring nucleons. 

In molecular spectra one finds rotational, vibrational, and elec- 
tronic excitation levels. In distorted nuclei one finds analogous levels 
assigned to rotation, vibration, and nucleon excitation. The nuclear 
rotational motion is not that of a rigid body but resembles a hydro- 
dynamical wave moving around the nucleus; it is a rotation of the shape 
of the deformed surface which bounds the A. nucleons, The resulting 
energy-level pattern is very similar to that for the rotational states of 
a molecule (Fig. 25.28). Not only does the deformed shape rotate, but 
there is also a vibration about the equilibrium shape. By choosing a 
suitable interaction between the motion of a single particle and collec- 
tive motions of the nucleus as a whole it is possible to obtain a descrip- 
tion of nuclear reactions which combines the essential features of both 
the independent-particle and the liquid-drop model... This approach has 
enjoyed a striking success in accounting for observed regularities in 
neutron-scattering cross sections at intermediate energies in medium 
and heavy nuclei. 


Nuclear Fission and Nuclear Energy 


25.19 Discovery of Fission 


a. Transuranic Elements The experiments which led to the dis- 
covery of fission hegan in 1934, in connection with investigations by 
Fermi and his collaborators of neutron-induced radioactivity. The 
striking ability of neutrons to produce 8--unstable isotopes from many 
naturally occurring elements suggested to Fermi that it might be pos- 
sible by this means to extend the periodic table to higher atomic numbers 
by neutron-induced processes in uranium. It was very soon shown by 
experiments in Rome that uranium samples can in fact be made radio- 
active by neutron bombardment, and, furthermore, several different 
materials are produced. Attempts at chemical identification soon 
revealed an embarrassing wealth of activities, which could be accounted 
for by known types of nuclear reactions only with considerable difficulty. 
Thus, it was concluded by Hahn, Meitner, and Strassmann in 1937 that 
at least three separate chains of transitions must be assumed, all starting 
with neutron capture in the most abundant isotope of uranium, 2U?*8, 
and involving several long-lived excited States of elements 93, 94, and 
perhaps 95. More problems arose with the discovery by Iréne Joliot- 
Curie and Savitch in 1938 of active materials which coprecipitated with 
lanthanum (Z = 57) and appeared to be “almost” inseparable from this 
\ 
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element by chemical means. Hahn and Strassmann showed that there 
are several activities which coprecipitate with barium and whose daughter 
products behave like lanthanum. Since radium and barium have almost 
identical chemical properties and can be separated only with extreme 
difficulty, it was concluded that isotopes of radium (Z = 88) must be 
involved, formed by an (n,2a) process from uranium. However it was 
difficult to understand how an (n,2a) process could occur in such a 
heavy nuclide and especially in one so relatively stable as U8. To 
establish the identification beyond doubt, Hahn and Strassmann set 
out to make the separation of the supposed radium isotopes from barium, 
using ThX and MsTh, (isotopes of radium) as carriers. To their 
surprise, they found that the activity followed the barium rather than 
the radium carriers and hence must be identified as barium itself! They 
soon found other activities which could be identified as isotopic with 
barium and lanthanum, and they were led to suggest that several of the 
supposed fransuranic elements might actually be isotopes of other ele- 
ments considerably lighter than uranium, possibly produced by an 
“explosion” of the uranium nucleus. 


b. Physical Demonstration of Fission A physical picture of the mech- 
anism by which a uranium nucleus could break up into two halves was 
first proposed by Meitner and Frisch, within a few weeks after Hahn and 
Strassmann’s publication of the chemical evidence. Basing their argu- 
ment on the liquid-drop model of the nucleus, they pointed out that the 
binding forces acting on particles at the nuclear surface and producing 
an effective surface tension might be overcome by the electrostatic repul- 
sion when the charge is sufficiently high. From a detailed model worked 
out by Bohr and Kalckar, they estimated that at Z ~ 100, only a small 
amount of extra energy—such as that brought in by a single neutron— 
might suffice to overcome the surface tension and cause the nucleus to 
break up into two roughly equal fragments. An estimate of the change 
in energy indicated that 100 to 200 MeV might be released in the process, 
an amount which roughly corresponds to the difference in (total) binding 
energy of U and of two elements near the middle of the atomic table. 
The elements thus formed would be quite unstable, because of their great 
neutron excess, and would decay through several stages before reaching 
a 

ven PETE of this hypothesis came almost immediately in the 
demonstration by Frisch, in Copenhagen, that ionizing fragments having 
energies of over 70 MeV were produced in a uranium-lined ionization 
chamber under neutron bombardment, and by Joliot-Curie, in Paris, 
that the active materials had sufficient kinetic energy to travel nearly 
3’cm in air; the energy he estimated to be of the order of 250 MeV. 
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The announcement of the results of Meitner and Frisch by Bohr at a 
conference in 1939 led to an intense activity among experimentalists 
in this country. Within a matter of days, the existence of heavily 
ionizing particles with enormous kinetic energies was confirmed in 
several laboratories, and many new and important discoveries were 
made in the course of the next few months. By mid-1939 the following 
facts were well established: 


1l. Fission occurs in both uranium and thorium under fast-neutron bombard- 


ment; in uranium, thermal neutrons are also effective, the cross section fol- 
lowing the 1/v law. 


2. U», with a half-life of 23 min, is formed in slow-neutron bombardment of 


uranium, the cross section exhibiting one or more strong resonances in the 
range 10 to 25 eV. 


3. The fragments resulting from uranium fission occur in two broad groups, 
with energies around 60 and 100 MeV. The one group comprises fragments 
with mass numbers near 140, the other, near 95; symmetrical disintegrations, 
producing nuclides with mass numbers 115 to 120, seldom occur. 


4. In about 1 percent of the fissions, delayed neutrons, presumably associated 
with the decay of one of the fragments, are emitted. 


5. A small number of neutrons—between one and three—is emitted simul- 
taneously with the fission. 


25.20 Theory of Fission 


a. Mechanism of Fission The theory of the fission process was worked 
out in detail by Bohr and Wheeler. In accordance with the common 
treatment of nuclear reactions (see Sec. 25.14), they assumed the process 
to occur in two distinct stages: the formation of an excited compound 
nucleus by capture of a neutron, and the subsequent decay of the sys- 
tem in one of several possible ways. In the compound nucleus, the 


ay well occur that disintegrations 
tions can compete favorably with 
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packed, so that ‘the system is essentially incompressible. As a first 
approximation, then, the compound nucleus formed in the capture of a 
neutron by a heavy clement is regarded as a drop of incompressible fluid. 
The simplest distortion possible for a fluid drop is that in which the drop 
elongates on some axis and assumes an ellipsoidal shape. The elonga- 
tion results in an increase of surface area, and so the surface-tension 
forces tend to restore it to a spherical shape. Given a sufficient energy, 
it is possible to produce a distortion so extreme that the drop divides 
into two nearly equal spherical parts. The energy required for this 
can be shown to be 0.26 of the total surface energy. If the effect ofa 
uniformly distributed electric charge on the drop is considered, it is 
evident that the elongation. decreases the electrostatic energy (since the 
charges are farther apart), so that thé electrostatic forces oppose the 
surface-tension. forces and less energy is required to split the drop. 

In the nuclear case, too, it is appropriate to think of a surface-tension 
effect,. since particles near the surface have fewer neighbors than those 
in the interior and hence contribute less to the total binding. In fact, 
in the extreme liquid-drop model, it is just this effect which accounts 
for the general trend toward low binding energy per particle in the light 
nuclei (Fig. 25.3), where the surface-to-volume ratio is high. The sur- 
face area of a nucleus, and hence its surface energy, is proportional to A}, 
where A is the mass number. The electrostatic energy, on the other 
hand, is proportional to Z*/ A} (Sec. 25.6), and the ratio Z?/A is thus a 
measure of the relative importance of the electrostatic and surface 
effects. Above a certain critical value for this ratio, a nucleus is unstable 
with respect to arbitrarily small distortions and will elongate more and 
more until it breaks into two roughly equal fragments. The magnitude 


(27/A)timig = 45, about 25 percent higher than the value for 92U288, 
hus it may be expected that elements of much higher atomic number 
than uranium are unstable to spontaneous fission. 


b. Fission Threshold For nuclei with less than the critical value of 
2*/A, a finite distortion is required to overcome the surface tension, and 
the amount of excess energy needed increases as the ratio Z*/A decreases. 

he question whether fission can result from the capture of a neutron 
by a given nucleus then depends upon whether the engegy brought in by 
the neutron (binding plus kinetic energy) is slice produce the nec- 
essary distortion of the compound system. This ergy turns out to be 
about 6 MeV for the nucleus U?*, formed by capture of neutrons in the 
Most abundant isotope of uranium, U*88, and about 5.3 MeV for Ur36, 
formed from U*s. The binding energy of a neutron in U29 is estimated 
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to be 5.2 MeV, so that neutrons of about 1 MeV kinetic energy are 
required to induce fission of U?**, Because U2 has the favored even-Z, 
even-V character, the binding energy is nearly 1 MeV higher in this 
nucleus and exceeds the energy required for fission. Thus fission by 
thermal neutrons is possible in U?**, The observed thermal-neutron 
fission observed in natural uranium is due to this isotope, which has a 
relative abundance of about 1 part in 140. i 


s 


c. Energy Release The total energy released in fission òf a uranium 
nucleus into two equal fragments can be estimated from the binding- 
energy curve of Fig. 25.3; near A = 120, the binding energy per particle 
is 8.5 MeV, while at A = 240, it is 7.6 MeV. Thus there is available 
0.9 MeV/nucleon, or 216 MeV. Fission at low excitation energies is 
rarely symmetric. For the most common type of ‘division, with mass 
ratio about 1:1.4, the energy release is about 200 MeV. Most of this 
energy appears immediately as kinetic energy, but about 20 MeV is 
released later in 8 decay of the fragments. A considerable adjustment— 
the transformation of some six or eight neutrons into protons—is needed 
to bring the fission products to the curve of stable nuclides, and radio- 
active chains of considerable complexity are involved. In a very few 
cases (less than 1 percent) neutrons are emitted in the course of the decay 
process; these are the so-called delayed neutrons, which follow the fission 
process with time delays varying from seconds to minutes. Altogether, 
more than 160 different radioactive nuclides have been found among 


the fission products: small wonder that early workers had difficulty in 
unscrambling their chemical properties! 


25.21 Prompt Neutrons—Chain Reactions 


Since the fission of uranium leaves the fragments with an excess of neu- 
trons as compared with the final, stable products, it might be expected 
that -ome neutrons would be set free even in the first separation. If each 
fissi. 1 results in the production of one or more neutrons, it may be pos- 
sible, with a suitable geometrical arrangement, to use these neutrons to 
produce fissions in other uranium atoms and, in this way, to bring about 
a self-sustaining chain reaction. This possibility was recognized in the 
first discussions of fission and led to considerable speculation as to the 
practicability of producing nuclear energy on a large scale by this means. 
The magnitude of the energy available, about 200 MeV per event, is 
staggering compared with the few electron volts available from a chemical 
reaction, and it was easy to see that the development had potentially a 


considerable importance as a source of power and, as events progressed, as 
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a military weapon. The crucial point in evaluating the feasibility of a 
self-sustaining reaction is the neutron economy. How many neutrons 
are emitted per fission, and how many of these can be expected to induce 
other fissions before being lost in some other reaction? 


a. Chain Reaction in Uranium The number of prompt neutrons, 
i.e., neutrons emitted simultaneously with the fission, produced in the 
thermal-neutron-induced fission of U2*5 can be determined by bombard- 
ing a thin sample of separated U**5 with a beam of thermal neutrons and 
observing the number of fast neutrons produced. The total number of 
fast neutrons is found to be 2.65 per fission. Their kinetic energies 
range up to 15 MeV (a few go still higher), with a most probable value 
near 1 MeV. If one of these 2.6 neutrons can be made to produce 
another fission, a chain reaction is possible; if more than 1.6 are lost or 
absorbed in other processes, the reaction will not be self-sustaining. 

Let us consider the problem of producing a chain reaction in a block 
of ordinary uranium. For simplicity we take the block to be infinite in 
extent so that no neutrons are lost through the surface. We assume that 
2.6 neutrons have been produced in a fission Process at some point in the 
block, and we endeavor to estimate whether one of these will produce 
another fission. Some of the pertinent cross sections are listed in Table 
25.4. For neutron energies about 1 MeV, fission of U2% is possible; 


Table 25.4 Neutron cross sections for some 
fissionable materials, in barns 


Average fission cross section, fast neutrons 
Fission cross section, thermal neutrons 
Radiative-capture cross section, thermal neutrons 
Inelastic-scattering cross section, fast neutrons 
Elastic-scattering cross section, fast neutrons 


t Natural uranium; U*#8/U235 = 140. 


the average effective cross section is 0.6 barn. The cross section for 
U5 fission in this energy range is larger than that for U?*, but because 
of the small fraction (zy) present, its effect here is negligible. of the 
neutrons initially above the U?** threshold, few have an opportunity to 
Cause fission; most of them will lose energy quickly through inelastic 
Scattering processes, in which a sizable fraction of the neutron energy 
is used up in exciting the U? nucleus. The cross section for inelastic 
Scattering is 3 barns—5 times that for fission. In addition, neutrons 
may lose energy, albeit rather slowly, in elastic collisions ¢=3 barns). 
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Thus of the roughly 30 percent of the initial neutrons which had energies 
above 1.5 MeV, about 10 percent—3 percent of the total—may produce 
fission in U2*, and the rest are slowed down to low energies. 

As the neutrons approach thermal velocities, fission of U2*5 assumes 
importance: the fission cross section of U**5 at thermal energy is 577 
barns. In its natural dilution this represents an effective cross section 
of 4.1 barns. Not all the neutrons which reach thermal velocity produce 
fission, however, because the (n,y) capture process occurs also, both in 
U? and U8, with an effective cross section of 3.4 barns in ordinary 
uranium. Thus only about half the neutrons which reach thermal 
energy produce fission in U***. A still more serious difficulty appears 
in the course of the slowing-down process; this is the resonant absorption 
of neutrons by U», in which U??? is formed. The (n,y) cross section 
exhibits many peaks in the region from 5 eV up to a few keV neutron 
energy, of which the most important lies in the neighborhood of 6.7 eV, 
with an effective width of 0.03 eV and a cross section of 20,000 barns. 
The scattering cross section at resonance is only about one twentieth 
of the capture cross section, and since elastic collisions reduce the energy 
in steps of only about one percent [see Eq. (25.1)], most of the neutrons 
will be captured in this resonance before reaching thermal energies. The 
actual situation is complicated, but the conclusion that a sustained 
reaction is impossible in ordinary uranium is probably correct. 


b. Moderated Reactors The problem of avoiding the resonant capture 
in U?! can be solved by the use of a moderator, i.e., a light material 
mixed with the uranium to slow the neutrons rapidly through the reso- 
nance region. In hydrogen, for example, the median energy is changed 
by a factor 1/e per collision, and the scattering cross section for slow 
neutrons is 20 barns (Fig. 25.21). With an admixture of, say, 100 
atoms of hydrogen (in the form of water or paraffin) to one of uranium, 
most of the neutrons would pass through the resonance region without loss 
by capture. Unfortunately hydrogen has a thermal-neutron absorption 
cross section of 0.33 barns, so that it swallows up the neutrons as soon as 
they reach thermal energy and prevents a self-sustaining reaction. Of 
the other light substances, lithium and boron present the same difficulty. 
Helium, deuterium, beryllium, and carbon, having very small thermal- 
absorption cross sections, remain as possible moderators. 


c. Critical Size In a reactor of finite size, escape of neutrons may 
be a problem. Since the escape is a surface effect and all other inter- 
actions are simply proportional to the volume, the smaller the reactor, 
the more serious the escape. The minimum size of reactor which ca 

just maintain itself is called the critical size; its magnitude can be roughly 
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estimated from the following argument. In the course of being ther- 
malized in a graphite moderator, a neutron makes about 100 collisions 
with carbon nuclei. The cross section for collision is about 4 barns; 
the mean free path between collisions is then 


1 12 
= 5N 4x 10-* xX 6 X 10® X 16 


=3cm 


A 


where o is the elastic-scattering cross section and N the number of nuclei 
per cubic centimeter. Since the neutron changes direction in a random 
way at each collision, the radial distance it has traveled from its point of 
origin after n, collisions is, on the average, \/ni X 3, or 30 cm. After it 
reaches thermal energy, the neutron makes further collisions before it is 
finally captured, producing either fission or an (n,y) process. The num- 
ber of collisions before capture will be n: = A/\, where A is the mean free 
path for absorption and à the mean free path for scattering. The dis- 
placement is then vns à, or (Ad). In a pile containing 100 atoms of 
carbon to 1 of uranium, A will have roughly 100 times its value in uranium 


100 100 X 238 


we oN 7.5 X10 X 6 X 10% ~ 19 ~ 300 cm 


where o is the combined cross section for fission and radiative capture. 
The scattering mean free path à, determined by the graphite, is again 
about 3 cm. The combined distance for thermalizing and capture, 
(30? + 300 X 3)! ~ 40 cm, is then a rough measure of the average 
displacement of a neutron between production and capture. The actual 
paths of individual neutrons are subject to statistical fluctuations, and 
an appreciable fraction—roughly 1/e—will suffer larger displacements 
than 40 cm. Also, since the displaced neutrons constitute the sources 
for further neutrons through the fission processes they induce, it is clear 
that the dimensions of the system must be several times the average 
path if the loss is to be kept below the few percent tolerance allowed. 
An actual calculation based on diffusion theory yields a value of about 
300 cm for the critical radius of a spherical graphite-uranium reactor. 


d. Reactor Control In practice, a reactor is built of such dimensions 
as to make it slightly above critical size, and the system is controlled by 
inserting rods or sheets of some material with a high neutron-absorption 
coefficient—cadmium is often used—into the reactor to reduce the mul- 
tiplication factor to 1.00. The control of a reactor could be a rather 
ticklish problem; the lifetime of a neutron from production to ultimate 
absorption is only a few milliseconds, and even a small excess of the 
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multiplication factor over 1.00 (supercriticality) might lead to cata- 
strophic results in a short time. Fortunately there are the delayed neu- 
trons, with lifetimes ranging from fractions of a second to some minutes. 
These neutrons, comprising about 1 percent of the total, provide an 
adequate time delay so that small changes in criticality are reflected in 
changes in neutron flux only after a time of the order of minutes or more. 


e. Power Production As a practical matter, the most interesting 
characteristic of a pile is the number of fissions per unit time or, what 
amounts to the same thing, the power level. Each fission releases about 
200 MeV of energy, most of which appears eventually in the form of heat. 
A reaction rate of 3.1 X 101° fissions per second corresponds to a power of 
1 W. The number of reactions which can be allowed to take place per 
second is limited by the rate at which this heat can be removed. The first 
successful pile had no special cooling and could be operated at only about 
2kW. Some reactors operate at power levels of thousands of megawatts. 
At a power level of 1 MW, the consumption of U?* is about 1.0 g/day. 
The attractiveness of a nuclear reactor as a power source is such that a 
major fraction of the large new electric power stations use nuclear energy. 


25.22 Fusion 


Although the phenomenon of fission is a most spectacular example of the 
conversion of mass energy into other, less passive forms, considerably 
more energy can be released by fusion of the lightest elements into 
elements lying near the middle of the atomic table. From the observed 
variation of the average binding energy per nucleon (Fig. 25.3) it can 
be seen that the combination of free protons and neutrons to make 
2sNi®, for example, would release 8.8 MeV/nucleon—more than 10 
times the energy released per nucleon in fission. About 7 MeV/ 
nucleon of this energy is already released in the combination of two 
neutrons and two protons into an a particle: because the « particle is so 


tightly bound, further combinations of a particles release comparatively 
little energy. 


Stellar Energy Sources The conversion of hydrogen into helium is 
generally believed to be the principal source of energy in the sun and 
the stars. Hydrogen appears to constitute more than 90 percent by 
mass of the total material in the universe, so that there is a comparatively 
large amount of energy available. Two possible mechanisms of conver- 
sion have been given serious consideration: the so-called carbon- 


$ 5 nitrogen 
cycle and the direct proton-proton chain. 


Both processes depend for 
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their operation upon extremely high temperatures. The reactions are 
brought about by the bombardment of one particle by another in the 
course of thermal agitation. Because the particles are charged, the 
probability of approach within the range of nuclear forces is very small 
indeed, even at the high temperatures and densities which obtain in stellar 
interiors, and, in fact, the reactions proceed mainly with particles whose 
energies lie high upon the tail of the Maxwell energy distribution, The 
carbon-nitrogen cycle requires the previous existence (formed in another 
process) of a small amount of carbon: the carbon is not depleted in the 
cycle but acts only as a kind of catalyst for conversion of four protons into 
an a particle. The reactions involved were proposed by Bethe and are 


C"2(p,y) N13 : N48(6+)C8 
C3(p,y7)N"™ 
N14(p,7) O15: O48 (B+)N15 
N45(p,a)C! 


The net result is 
4H! —> Het + 28+ + 25 + 25 MeV 


Most of this energy will be absorbed locally and converted into thermal 
energy which maintains the reaction; the temperature will remain fixed 
at such a value that the rate of production just equals the rate of loss by 
thermal radiation. a 
The probability of any of these (p,y) processes at low energies is gov- 
erned by the Gamow penetration factor [Eq. (25.19)], which is a steep 
exponential function of the kinetic energy. At the estimated tempera- 
ture of the sun’s center the most probable energy kT of the protons is 
about 1 keV, but the rise of the cross section is so steep that the main 
contribution comes from the few protons with thermal-agitation energies 
Some 10 times this value. The rate of reaction increases about as the 
twenti the temperature. 
ae MAc sc of protons may take place through the 


reaction 
H'(p,6+) H? 
followed by such reactions as 


H?(p,y)He* - He*(He?,2p)He! He?(d,p)Het 
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The coulomb barrier is relatively less important here than in the reactions 
involving carbon and nitrogen, and the rate is less temperature-depend- 


ent. In the sun the (p,p) reactions appear to contribute most of the 
energy; in hotter stars, the C-N reactions dominate. 


Problems 


1. Compute the Q’s for the following reactions (see Table 10.2 for 
masses): (a) 7N™(a,d)s0%*; (b) «Be®(d,n),B1; (c) sBe%(a,n)6C12; (d) 
aLi®(d,a)2He'; (e) sF!?(p,æ)s0*8. 

2. An important reaction in fusion research is H?(d,p)H?. Find the 


Q of this reaction and the energies of the proton and triton if the energies 
of the deuterons can be neglected. 


3. Calculate the de Broglie wavelength of a 30-GeV proton and of a 


20-GeV electron. How do these values compare with the distance 
between nucleons in nuclei? 


4. A neutron of mass m and initial speed v collides head on with a 
nucleus of mass M at rest. Show that if the collision is elastic, the 
kinetic energy K transferred to the nucleus is 2Mm*v?/(M + m)?. Find 
the fraction of the energy of the neutron transferred to (a) a proton of 
mass m, (b) an «a particle of mass 4m, and (c) a uranium atom of mass 
238 m. 


5. (a) Calculate the maximum energy of protons obtainable from a 
cyclotron of 1.2 m dee diameter and 1.5 Wb/m? magnetic induction. 
At what frequency must the cyclotron be operated? If the average 
energy gain per dee passage is 50 keV, how many revolutions do the 
protons make? 

(h) Calculate the maximum energy which the cyclotron can give 
apa: cles. Atwhat frequency must the cyclotron be operated to achieve 
this energy? What is the a-particle energy which can be achieved using 
the frequency optimum for protons? 

Ans: (a) 39 MeV; 22.9 MHz; 390 


6. Show that if the 4.5-MeV protons observed by the Joliot-Curies 
(Sec. 25.2) were indeed recoils from Y rays, the latter would have an 
energy of about 50 MeV. 


7. Calculate the energy of the y ray emitted by an excited B" nucleus 
formed by the capture of a slow neutron by a B10 nucleus. 
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8. It is estimated that the energy released in the atomic-bomb explosion 
at Hiroshima was about 7.6 X 10! J, equivalent to 20,000 tons of TNT. 
If an average of 200 MeV was released per fission, and if all fissions were 
by neutron capture in U**, find the number of U? atoms fissioned and 
the mass of U? consumed. If 20 percent of the U?*5 atoms fissioned, 
what mass of U? was needed for the bomb? 

9. If a beam of thermal neutrons is passed through a cadmium foil 
0.5 mm thick, what fraction of the beam is absorbed? All Cd isotopes 
have negligible absorption cross sections except for Cd!"8, which has a 
capture cross section of 20,000 barns and a natural abundance of 12.26 
percent. The density of Cd is 8650 kg/m*. 

10. When U** captures a slow neutron, it fissions. If the fission 
products are Rb? and Cs'°, how many neutrons are emitted? If the 
masses of U?5, Cs'4°, and Rb®? are respectively 234.043915, 139.917110, 
and 91.919140 u, find the energy released in this particular fission. 

11. A crude estimate of the reaction (or absorption) cross section ofa 
nucleus for neutrons or protons of moderate energies can be obtained by 
assuming that any nucleon for which the center of mass reaches the edge 
of a nucleus is captured. Show that for neutrons this cross section is 
about Rè, where R is the nuclear radius, while for protons it is 
TR?(1 — Ze?/4reoRK) for proton kinetic energy K greater than Zet/4reoR 
and zero for smaller values of K. Hint: Use the conservation of angular 
momentum. 


12. Estimate the radius of the ,Si27 nucleus from the fact that the 
maximum kinetic energy of the positron emitted in the reaction uSi” > 
1Al” + e+ + y is 3.48 MeV. To what value of Ro in Eq. (25.5) does 
this lead? (Assume that the charge is distributed uniformly throughout 
the nuclear volume.) 
13. (a) Calculate the Q of the reaction Li?(p,)He*. 

(b) If the radius of the Li nucleus is 1.4 X 10-154? m, compute 
the height of the coulomb barrier encountered by a proton reaching a 
Li” nucleus. How do you explain the fact that Cockroft and Walton 
excited this reaction with 125-keV protons? zn f 

(c) If the reaction is produced by 1.2-MeV protons striking a thin 
Li target, what is the maximum kinetic energy of an «æ particle ejected 
forward in the laboratory reference frame? 
14. Find the Q value of the reaction He‘(a,p)Li’. If a beam of a 
Particles is incident on a stationary helium target, calculate the minimum 
kinetic energy required of the incident a particles to excite this reaction. 
15. Show that when a nuclear reaction has a negative Q value, the 
threshold bombarding energy for observation of particle 2 (Fig. 25.14) 
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16. Find the Q of the endoergic reaction Li’(p,n)Be’ if the mass of Be? 
is 7.016929 u. What is the minimum bombarding energy required for 
protons striking stationary lithium nuclei? $ 
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Aberration, stellar, 24 
Absorption of x-rays, 396-398, 
635-644 
pair production, 640-642 
photoelectric, 637 
scattering, 638-640 
Absorption edges, 396-397 
Accelerators, nuclear, 693-699 
Acceptors, 616, 619 
Age of earth, 221 
Alkali atoms, term energies and 
Spectra of, 230-232, 434-441 
Alkali halides, 564-567 
Alkali metals, 569-570, 577-578 
Alpha decay: 
stability against, 675 
theory of, 685-686 
Alpha Particles, 205-208 
` range of, 650 
Scattering of, 209-213, 221-294 
spectra of, 252 
transmutations by, 663, 689-690 
Ammonia inversion spectrum, 522-525 
Angular momentum: 
of atom, 383-386, 431-434. 
with one electron, 366-368 
in Bohr’s theory, 234, 235 


of electron in central field, 383, 
384, 431 


of electron spin, 243 


Angular momentum: 
indeterminacy of, 368 
in magnetic field, 242, 243 
of molecules, 501, 511 
of nucleus, 481-485, 703 
operator for, 335, 368 
orbital, 366-369 
Annihilation radiation, 265, 636 
Antiparticles, 266-269 
Antisymmetric wave functions, 385, 
403, 497, 551-553 
Atmospheres, law of, 107 
Atom: 
Bohr theory of, 233-236, 241-248 
electronic structure, 389-394; 
753-754 
many-electron, 381-386 
mass of, 257-260 
models of, 209, 211 
one-electron, 357-377 
preternatural, 272 
size of, 110 
Atomic beams, 478 
Atomic mass unit, 261 
Atomic masses, 98, 218, 261, 263 
Atomic number Z, 184, 213, 753-754 
\ Attenuation coefficient, 396, 636-637, 
643 
Auger effect, 420 
Avogadro's number N4, 99 
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Balmer series, 227-230 
Bands, spectral, 499, 506-509, 514-518 
alternating intensities in, 552- 
554 
Barkla, C. G., 182, 183, 190-192 
Barn, definition of, 190, 722 
Barrier penetration, 313, 524, 685 
Baryons, 268-270, inside back cover 
Becquerel, H., 203, 204 
Beta decay, stability against, 675 
Beta rays, 205 
nature of, 205-206 
spectra of, 255-257 
(See also Electrons) 
Betatron, 698 
Bethe, H. A., 648, 651, 749 
Binding energy: 
molecular, 491 
nuclear, 672-675 
solids, 562-563 
Blackbody radiation, 117-146, 550 
Bloch function, 591 
Bohr, N., 233, 244, 646, 742 
Bohr atom, 233-239, 244-246 
Bohr correspondence principle, 244~ 
246 
Bohr magneton, 241 
Bohr radius, 235 
Boltzmann constant k, 100 
Boltzmann distribution, 107, 109, 
535, 543 
Boltzmann radiation law, 119, 138- 
140 
Bose-Einstein statistics, 537-539, 
542, 549 
Boundary conditions, 30, 330 
Bragg law, 178 
for electrons, 592 
Bragg. planes and spacings, 176 
Breadth of spectral lines, 458-461 
Breit-Wigner resonance formula, 
713-714 


Bremsstrahlung, 636, 652, 654 

Brillouin zones, 596-603 

Bubble chamber, 659 

Burger-Dorgello-Ornstein sum rule, 
438 


Cavendish, H., 25 
Cerenkov radiation, 653-655 
Chain reactions, 744 
Characteristic function (see Eigen- 
functions and eigenvalues) 
Charge cloud, 363, 382, 492 
Charged particles: 
detection of, 654-658 
energy loss of, 646-654 
range of, 650 
Cloud chamber, 657 
Cockroft-Walton experiment, 691 
Collective model of nucleus, 739 
Collisions of the second kind, 451 
Commutator operator, 336 
Compound nucleus, 719 
Compton effect, 192-198 
cross section for, 640 
inverse, 202 (Probs. 24, 25) 
Conduction band, 572, 609, 612 
Conductivity, 573-575, 613-614 
Configuration, electronic, 386, 444, 
753-754 
Conservation laws, 270, 703 
Contact potentials, 583 
Contraction, Lorentz-FitzGerald, 53, 
59 
Cooling, thermoelectric, 630 
Correspondence principle, 244-246 
Cosmic rays, 264, 636 
Coulomb barrier (see Nuclear barrier) 
Coulomb excitation of nucleus, 702 
Coulomb field, 88, 89, 271, 684-686, 
691, 749 
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Coupling, 444 
Jj, jK, and LK, 455-458 
AS, in molecules, 511 
LS (Russell-Saunders), 445-455 
with nucleus, 483 
Q, in molecules, 513 
Covalent binding, 392, 490 
molecules, 496 
solids, 563, 567 
Cross section, 190, 721 
differential, 723 
fissionable materials, 745 
neutron-proton scattering, 722 
x-ray scattering, 190, 640 
Crystal diffraction grating, 175-178, 
595-598 
Crystals, 559 
covalent binding in, 565, 567-568 
energy bands of, 572 
ionic binding in, 562, 564-567 
metallic binding in, 562, 568-570 
structure and types, 560-562 
wave mechanics of, 590-602 
Curie, M., 204 
Curie (unit), 220 
Cyclotron, 696 


Davisson-Germer ex 
286-288 
de Broglie waves, 280-286, 728 
Debye theory of heat 
547-549 
Degeneracy, 339, 373 
exchange, 402 
nuclear, 482 
Degrees of freedom, 101 
of atoms and molecules, 101-102 
of elastic waves, 548 
for waves in an enclosure, 121-125 


Density of states, 576, 601-603, 
606, 612 


periment, 


capacity, 
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Depletion layer, 621, 628, 629 
Deuteron, 673, 678-680, 702 

Dirac electron theory, 265 
Disintegration constant, 215 
Displacement law, Wien’s, 120, 146 
Dissociation of molecule, 491, 505, 509 
Donor, 616, 619 

Doping in semiconductors, 615-617 
Doppler effect, 30, 79, 459 

Dosages, radiation, 644-645 
Dulong-Petit law, 102, 546 


Earth, age of, 221 
Edison effect, 150 
Effective mass, 593-595 
Ehrenfest theorem, 320, 325 (Prob. 
10) 
Eigenfunctions and eigenvalues, 331 
for a box, 336-339 
harmonic oscillator, 339, 340, 350 
one-electron atom, 360-363 
Properties of, 342 
(See also Wave functions) 
Einstein frequency condition, 234 
Einstein heat capacity of solids, 
546-547 
Einstein photoelectric equation, 163 
Einstein theory of relativity, 54 
Einstein transistion probabilities, 349 
Electromagnetic field: 
Maxwell's equations for, 84 
of point charges, 88, 89 
and relativity, 47, 82 
Electron density, 577, 612, 614, 618, 
619 ‘ 
Electrons: 
capture in nuclei, 671, 676 
charge of, 158, 159 
classical radius of, 66 
in complex atoms, 382-386 
creation of, 264-266 
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Electrons: 
discovery of, 155, 157 
effective mass of, 593-595 
e/m of, 155, 157 
energy loss by, 651-654 
“free,” 576-580 _ 
magnetic moment of, 244 
in periodic lattice, 590-593 
radiation by, 652 
recoil, 196—197 
rest energy of, 66 
spin of, 243, 369, 405, 467 
stopping of, 651-653 
thermionic emission of, 580-582 
wavelength of, 285 
Elements, 387 
isotopic composition of, 262 
transuranic, 740 
Emittance, blackbody, 119 
Endoergic and exoergic reaction, 704 
Energy: 
Conservation of, 28, 37 
equipartition of, 101 
of gas molecules, 100 
loss by charged Particles, 646-654. 
and mass, 65, 80 
relativistic, 64 
released: in fission, 744 
in nuclear reactions (see Q value) 
rest, 65 
Separation, 675 
(See also Binding energy) 
Energy gap, 572, 609-614 
Energy levels, 238 
in central field, 383 
in magnetic field, 463-478 
molecular, 499-524 
nuclear, 251-255, 729 
in solids, 569 1 
x-ray, 398, 412 
Equipartition of energy, 101 
Equivalent electrons, 452-455 


Esaki diode, 625-626 

Exchange effects, 402, 405, 408 

Exchange forces, 682 

Exclusion principle, 381, 384, 406, 
452, 536, 683 

Expectation values, 317, 319, 333 

Extrinsic semiconductor, 615-619 


Faraday (unit) of electricity, 150 
Fermi-Dirac statistics, 539-543, 576 


Fermi distribution function, 540, 542, 


576, 612 


Fermi energy (level), 540, 542, 577, 
599, 613, 617 


temperature variation in metal, 579 
in extrinsic semiconductor, 617- 
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Fermi surface, 599-603 
Field tensor, 86 
Fine structure: 

in alkali-type spectra, 436-441 

in molecular Spectra, 513 

in the NH; inversion line, 524 
Fine-structure constant a, 236 
Fission (see Nuclear fission) 


FitzGerald-Lorentz Contraction, 53, 59 


Fluorescence, 183 
Fluorescence yield, 420 
Four-vectors, 74-85 

Frame of reference, 46, 90, 37] 
Franck-Condon principle, 518 
Fusion of light nuclei, 748 


Gamma rays, 205, 208 
absorption of, 635-645 
detection of, 655 | 
discovery of, 205 
internal conversion, 701 
from nuclear reactions, 713-717 
origin of, 255 


Gamma rays: 
pair production by, 264, 640 
scattering of, 638-640 
spectra of, 255 
Gap energy, 572, 609-614 
Geiger counter, 655 
Germanium, 613-614, 616, 617 
Grating space, 180, 181 
Gravitation, 21, 90-92, 270 
Ground State, 239, 338, 339 
nuclear, 252 
Group velocity, 282-284, 594 
Gyromagnetic ratio, 241- 


Hadrons, 271 
Half-life, 215, 219-221 
Hall effect, 587-590, 604-605 
Hamiltonian operator Â, 303 
Harmonic oscillator, 126-131, 339, 
345, 350 
Hartley’s law, 228 
HCI, molecular spectra of, 503, 508+ 
510 
Heat, discoveries in, 23, 27 
Heat capacity: 
Dulong-Petit law, 102 
of gases, 102, 543-546 
of hydrogen, 545, 555 
of solids, 102, 546 
Debye theory, 547-549 
Einstein theory, 546-547 
Heisenberg uncertainty principle, 296, 
320, 460 
Helium, 387, 389 
ionized, 239 
in radioactive minerals, 221 
Spectrum of neutral, 441-443 
Holes, 594, 603-605, 612 
density of, 613, 614, 618, 619 
Homonuclear molecules, 550-556 
Hund’s rule, 454 


Index 761 


Hydrogen: 
Bohr's theory of, 233-238 
energy levels of, 237, 238 
heat capacity of, 545, 555 
heavy molecule, 496-499 
ortho- and para-, 554-556 
spectrum of, 228, 229 
wave mechanics of, 358-365 
Hydrogen molecule ion, 493-495 
Hyperfine structure, 480-485 
Hyperons, 269 


Indeterminacy principle, 296, 320, 460 
Induction accelerator (betatron), 698 
Interactions, strong and weak, 269- 
272 

Intercombination lines, 232 
Internal conversion, 701 
Interval between events, 73, 74, 78 
Intrinsic semiconductor, 611-614 
Inversion spectrum of NH;, 522-525 
Tonic bond, 490-492, 562, 564-566 
Ionization of atoms, 393 

by charged particles, 646, 651, 

654-658 

inner, 396, 399 

multiple, 421-423> 
Tonization chamber, 174, 180, 655 
Ionization energy or potential, 238, 

393, 753-754 

Tsobars, 218, 676, 730-732 
Isoelectronic sequence, 240, 390 
Isotone, 736 
Isotope effects, 480-485 
Tsotopes, 260-263 

radioactive, 217-219 
Isotopic spin, 732 


Junction: 
metal-metal, 582-583 
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Junction: 
metal-semiconductor, 627-629 
p-n, 620-626, 630-632 
Junction rectification, 621-623 


Kelvin, Lord, 28, 43 

Kepler’s laws, 14 

Kinetic energy, relativistic, 64, 80 
Klein-Nishina scattering, 638-640 


Landé g factor, 467-469 

Landé interval rule, 448, 449 
Larmor precession, 243, 472 

Laue spots, 175, 176 

Leptons, 268-270 

Linear accelerator, 699 

Liquid-drop model of nucleus, 718 
Lorentz, H. A., 48, 50, 53, 58-61, 66 
Lorentz transformation, 57, 75 
Lorentz unit, 470 


Magic number nuclei, 733-738 
Magnetic field: 
effect on atoms, 153-155, 463 
on angular momentum, 242, 243 
on energy levels, 465, 469 
motion of electrons in, 243, 472- 
474, 477 
relativity transformation, 83-89 
Magnetic moment, 241 
of atoms, 478-480 
of electron spin, 244, 467 
of electron system, 458-470, 474, 
476 
of nuclei, 482-485, 669 
due to orbital motion, 241, 465 
of protons and neutrons, 263 
Magnetic rigidity, 205 
Magnetism, 606 


Magneton: 
Bohr, 242, 244 
nuclear, 263 
Majorana force, nuclear, 683 
Mass: 
atomic unit of, 261 
effective, 593-595 
and energy, 65-68, 80 
reduced, 238, 359 
variation with speed, 62 
Mass number A, 218 
Matrix: 
diagonalized, 402 
for Lorentz transformation, 75 
Matrix element, 347 
Matthiessen’s rule, 574 
Maxwell-Boltzmann distribution, 108, 
109, 535, 543 
Maxwellian velocity distribution, 
103-106 
Maxwell’s equations, 84 
Mean life, 216, 348 
Mechanics: 
discoveries in, 23 
and geometrical optics, 279-282 
relativistic, 62-67, 80 
Mercury, arc spectrum of, 449-452 
Mesons, 267, inside back cover 
Metals, 572 
binding of, 563, 568-572 
conduction in, 573-575 
in contact: with metal, 582-583 
with semiconductor, 627-629 
Sommerfeld model, 576-579 
thermionic emission from, 580-582 
Metastable levels, 451 
Michelson-Morley experiment, 50 
Miller indices, 562 
Millikan, R. A., 158, 159, 165 
Minkowski force, 81 
Minkowski rotation, 75 
Mirror nuclides, 707-709, 730-732 
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electromagnetic, 65, 135 

four-vector, 78 

operator, 302, 334 

relativistic, 63, 78 

in wave mechanics, 318 
Moseley’s law, 183-185, 240, 416 
Méssbauer effect, 255 
Multiplet rule, 454 
Multiplets, 228, 447-449, 454 
Multiplicity (2S + 1), 454, 461 

(Prob. 2) 

Mu neutrino, 267, inside back cover 
Muon, 60, 266, 273, inside back cover 


ae 


Neptunium, 219, 220 
Neutrino, 257, inside back cover 
mu, 267, inside back cover 
Neutrons, 263, inside back cover 
capture of, 726-728 
detection of, 724 
diffraction of, 289, 728 
discovery of, 664 
energy loss in hydrogen, 725 
production of, 720 
Properties of, 263, 264 
proton Scattering by, 722 
Teactions induced by, 720 
released in fission, 742, 745, 748 
resonance absorption of, 727 
Scattering of, 680-681, 728 
slow and thermal, 725 
total cross section, 721 
Nitrogen nucleus, 553-554 
Normalization, 296, 331, 340, 361, 365 
Nuclear barrier, 684, 691, 749 
Nuclear fission, 740-748 
cross sections, 745-746 
energy release in, 744 


Mobility, 574, 590, 605, 614 Nuclear forces, 271, 677-688 
Moderator, neutron, 746 Nuclear models: 
Momentum: collective, 739 


independent particle, 733-738 
liquid-drop, 718 
shell, 733-738 


Nuclear reactions, 700-728 


alpha-particle induced, 663, 689 
artificially induced, 662 
conservation laws for, 703 
determination of mass by, 703-707 
energy balance Q, 704 

excitation function, 713 

fission (see Nuclear fission) 

fusion (see Fusion of light nuclei) 
kinetics of, 703 


.of neutrons, 723-728 


particle groups in, 709 
resonances, 713 
types, 700 


Nuclear reactor, 746-748 
Nucleon, 263, 671 
Nucleus: 


angular momentum, 481-485, 703 

binding energy of, 672-675 

compound, 719 

constituents of, 263, 671 

electric quadruple moment, 481, 
525, 670 

electrostatic energy of, 684, 743 

energy levels of, 252-255, 709-7) 
729-731 

isotopic spin of, 732. 

models of (see Nuclear models) 

properties of, 668-670 

quantum states of, 713-717 

radius of, 669 

spin of, 481-485, 669 

stability of, 674-677, 686-688 

statistics of, 670 


Nuclide, 218 


mirror, 707-709, 730-732 
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One-electron atom, 233-248, 357-366 
Operator, 302, 334 
commutator, 336 
Orbits in Bohr model, 234, 235 
elliptical, 246-248 
Ortho states, 551 
Orthogonality of functions, 342, 352 
Oscillators: 
Planck's theory of, 126-131 
wave mechanics of, 339, 350 


Pair production, 264, 640-642 
Para states, 551 
Parity, 341, 703 
Particle in a box, 336-339 
two particles, 401-405 
electrons, 405-410 
Paschen-Back effect, 472-476 
Pauli exclusion principle, 381, 384, 
406, 452, 536, 683 
Peltier emf, 629-630 
Periodic table, 387, inside back cover 
Perturbation theory, 345, 370 
Phase velocity, 281-284 
Phase waves, 278, 282 
Phonons, 547, 574 
Photodisintegration, 637, 702 
Photoelectric absorption, 396-401, 
637-638 
Photoelectric effect, 151, 159-161, 165 
and Compton effect, 198-199 
Einstein law for, 163, 165 
nature of, 161-165 
due to x-rays, 396-401, 637-638 
Photons, 65, 79, 291-294 
Photovoltaic effect, 623-625 
a, g lines, 470-471 
Pions, 266 
Planck's constant h, 130, 134 
Planck's radiation law, 126-135, 550 
Positron, 264-266, 676 


Positronium, 265 
Potassium chloride, 490-492, 564-567 
Preston's rule, 471 
Preternatural atoms, 272 
Probability density, 296, 363 
robability stream density, 321-324. 
Proper time, 77 
Proportional counter, 655 
Protons, 263, inside back cover 
energy loss by, 646-651 


Q value, 704-707 
Quadrupole, electric, in nucleus, 481, 
525, 670 
Quantum numbers: 
F, Mr, 483 
I, M, (nucleus) 482-485, 703 
J, Mj, 372 
L, S, J, Ms, 432, 433 
L, S, Mz, Ms, 446, 474 
for molecules: J, 501 
K, S, A, Q, 511-514 
v, 505 
n, l, j, mj, 384 
n, l, m, m,, 360, 384 
principal, 384 
effective, 395, 416, 417, 439 
S, Ma, S, Ms, 243, 445, 446 
Quantum states, 127-130, 233 
molecular, 510-516 
nuclear (see Nucleus) 
x-ray, 412 
(See also Energy levels; specific 
topics) 
Quantum iheory, 130, 233-239, 246 
(See also Wave Mechanics; specific 
topics) 


Radiation: 


absorption and emission of, 182, 
347-349 


by charged particles, 652 


d 


Radiation: 
absorption and emission of: 
by dipole or oscillator, 130 
annihilation, 265 
blackbody, 117-146 
Cerenkov, 653-655 
electromagnetic, 65, 292-294 
Doppler effect, 79 
intensity of, 189, 293 
nature of, 198 
pressure due to, 135-140 
Radioactive series, 219 
Radioactive, transformations, 214-219 
Radioactivity, 204, 214, 251-257 
induced, 662, 663, 689 
Radioisotopes, 217-219, 690, 703 
Radium: 
discovery of, 204 
half-life of, 220 
Radon, 214-216 
Raman effect, 518-522, 553 
Range of charged particles, 650 
Rayleigh-Jeans formula, 125, 133 
Reactor, 746-748 
Rectifier, p-n, 621-623 
Relativity, 54 
in electromagnetism, 47, 82-89 
galilean-newtonian, 45 
theory of general, 90 
in wave mechanics, 369, 370 
Relativistic effects in spectra, 369 
Resistivity in metals, 574-575 
Resonances, nuclear, 713 
Richardson-Dushman equation, 581-582 
Ritz combination principle, 232 
Roentgen (unit), 644 
Röntgen rays (see X-rays) 
Rotation spectra, 500-505 
Rutherford, E., 207, 211, 221, 663, 666 
Rydberg constant, 231, 239 
Rydberg formula, 232 
Rydberg-Schuster law, 232 
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Satellite lines in x-rays, 422 
Scattering: 
of charged particles, 209 
neutron-proton, 680=081 
nuclear reactions, 700 
Rutherford, 221-224 
x-ray: classical, 188-192 
Compton, 192-198 
Klein-Nishina, 638-640 
Schrédinger equations, 300-303, 308, 
315, 329 
in spherical polar coordinates, 359 
(See also specific topics) 
Scintillation counter, 209, 656 
Secular equilibrium, 217 
Selection rules: 
atomic spectra, 433, 446 
harmonic oscillator, 345 
nuclear, 703, 717 
x-rays, 413, 414 
Zeeman effect, 466, 470, 474, 477 
Semiconductors, 573, 609-632 
extrinsic, 615-619 
intrinsic, 611-614 
p-n junction, 620-626 
transistor, 630-632 
Separation energy, nuclear,675 
Series, spectral, 230, 439-441 
hydrogen, 228, 239 
relation to terms, 232 
x-ray, 180-186, 413 
Shell model of nucleus, 733-738 
Shells and subshells, 385, 398, 399 
Silicon, 610, 615-619 
Simultaneity, 55 
Sodium, 569, 577 
charge cloud for, 394 
energy levels, 437 
spectrum, 439-441 
Zeeman effect for, 470, 472 
Sodium chloride crystal, 176, 180, 564 
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Solar cell, 625 
Sommerfeld, A., 246-248, 576 
Space and time, 47, 55-60, 73-76 
Space quantization, 368, 478-480 
Spark chamber, 658 
Specific heat (see Heat capacity) 
Specific ionization, 651 
Spectra, 230-233 
absorption, 502-503, 508 
alkali type of, 434-44] 
are and spark, 239 
hydrogen, 228-230 
from ionized atoms, 239 
molecular, 499 
electronic, 501, 514-518 
homonuclear, 550-556 
Raman, 518-522, 553 
rotation, 500-503 
vibration-rotation, 500, 503-510 
from one-electron atoms, 238-240 
two-electron type of, 441-443, 
449-452 
(See also Series; X-rays) 
Spin: 
‘electron, 243, 369, 405, 467 
neutron and proton, 263 
nuclear, 481-485, 703 
Spin-orbit coupling: 
in central field, 371-373, 417 
on LS terms, 449 
in molecules, 511, 513 
in nucleus, 735 
in. one-electron atoms, 371-373 
Square well, 315, 328, 497-499, 591 
Stark effect, 461, 485 ; 
Stationary states, 234, 327. 
Statistical weight, 509, 510, 543, 551 
Statistics: 
Bose-Einstein, 537-539, 542, 549 
Fermi-Dirac, 539-543, 576 ` 
Maxwell-Boltzmann, 104-109, 529- 
536, 541 ofr 


Statistics: 

of nuclei, 670 
Stefan-Boltzmann law, 119, 138 
Stellar energy, 748-750 
Stenström’s formula, 429 (Prob. 11) 
Step barrier, 309, 313, 497-499, 591- 

592 

Stern-Gerlach experiment, 478-480 
Stopping coefficient, 649, 651, 652 
Streamer chamber, 658 
Synchrotron, synchrocyclotron, 698 


Terms, spectral, 233, 436-442, 449-451 
753-754 
(See also Energy levels; specific 
topics) 
Thermionic emission, 166, 167, 580-5& 
Thermoelectricity, 629-630 
Thermonuclear processes in stars, 749 
Thomas precession, 371, 379 (Prob. 20 
Thomson, J. J., 156, 188, 190, 209 
Time, 46 ‘ 
dilation of, 60 
proper, 77 
Transistor, 630-632 
Transition elements, 393, 605 
Transitions, 343, 347 
Einstein coefficients, 348, 349 
Transuranic elements, 740 
Triton, 700 
Tunnel diode, 625-626 
Tunnel effect, 313, 524, 625, 685 


Uncertainty Principle, 296, 320, 460 
Uranium, 204, 221 

fission of, 740-748 : 

radioactive series, 219 


Valence band, 572, 609, 613 
Van de Graaff electrostatic accelera- 
tor, 694 i 


| 
| 


Van der Waals attraction, 493, 563 

Velocities, transformation of, 61 

Vibration-rotation spectra, 500, 503- 
510 


Wave equation, Schrödinger, 300— 
303, 308, 315, 329, 359 
(See also specific topics) 
Wave function, 290-300 
of electron with spin, 370, 401—408 
orthogonality of, 342, 352 
Symmetry of, 341, 403, 551 
(See also Eigenfunctions; Wave 
mechanics; specific topics) 
Wave mechanics, 293-296, 307-369 
perturbation theory, 345-347, 370 
relativistic, 370 
significance of W in, 294 
(See also Eigenfunctions; Quantum 
states; specific topics) 
Wave number 7, 230 
Wave number (k= 2x/X), 310, 591 
Wave packet, 317, 320, 321 
Waves: 
electromagnetic, 65, 292 
(See also Radiations X-ra ys) 
in enclosure, 121-195 
matter, de Broglie, 278-290, 728 
refraction of, 280 
wavelength of, 285 
phonon, 547-549 
Widths: 
of atomic states, 460 
of nuclear states, 716-717 
of spectral lines, 458-461 
‘Wien's displacement law, 120, 146 
Wilson cloud chamber, 657 


i 
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X-rays, 171-199 


absorption of, 396-398, 635-644 
Auger effect in, 420 
Bragg's law for, 178 
characteristic, 181-185, 413-415 
discovery and properties of, 172-173 
energy levels for, 398, 412 
excitation of, 173, 415 
first-order or diagram lines, 414, 422 
fluorescent, 182, 183 
measurement of, 174 
photoelectric effect of, 399, 637 
reflection: from crystals, 176-178 
from ruled gratings, 427 
refraction of, 426 
satellite lines, 422-493 
scattering of, 638-640 
classical, 188-192 
Compton, 192-198 
screening constants, 416-419 
spectra: characteristic, 181-185, 
413-415 
continuous, 185-187 
K, L, M, N lines, 182, 413, 414 
satellite (second-order) lines, 422 
of solids, 424-426 
spectrometer for, 179-181 
wavelengths of, 174-178 
X unit, 180 


Z (atomic number), 184, 213, 753-754 
Zeeman effect, 153, 463-478 

classical theory of, 153-155 

in huge field, 476-478 

normal triplet, 153, 465 

in strong field, 472-476 

in weak field, 469-471 
Zero-point energy, 131, 331, 340 


Phystcal Constants 


Speed of light c = 2.997925 x 108 m/s 
Fundamental charge e = 1.60210 x 10-19 C 
Avogadro’s number Na = 6.02252 x 1076 kmole—1 
Boltzmanns constant k = 1.3805 x 10-23 J/K° 

= 8.6171 x 10-5 eV/K° 
Planck’s constant h = 6.6256 X 10-34 J-g 


= 4.1356 x 10-15 eV.g 
ti = 1.05499 x 10-36 J.s 

= 6.5819 x 10-18 eV-s 
Permittivity of free space e = 8.8543 x 10-12 F/m 
Coulomb’s law constant 1/4tre) = 8.9878 x 10° m/F 
Permeability of free space jp = 4r X 10-7H/m 
Universal gas constant R = 8314.1 J/kmole-°K 
Gravitational constant G = 6.670 x 10-11 N-m?/kg? 


Rest Masses 


Electron m, = 9.109] x 10-31 kg = 5.486 x 10-tu = 0.511006 MeV/c? 
Proton M, = 1.6725 x 19-21 kg = 100727663 u = 938.256 MeV/c? 
Neutron Mx = 1.6748 x 19-2 kg = 1.0086634 u = 939.550 MeV/c? 

u (amu) M, = 1.6604 x 10-7 kg = 1 u = 931.478 MeV/c? 


k 


Conversion Factors 


l meter (m) = 39.37 in, = 3.281 ft = 6.214 x 10-* mile = 103 fermis 

lin. = 0,02546000 m; 1 0.3048 m; 1 mile Perit 

1 nantical mile = 1852 m = 1.1508 mile = 6076.10 ft 

l angetrom unit (A) = 10-10 m; 1 micron (p) = 10—¢ m; 1 millimicrom (ma) 
=10-°m 

Area 

1 m? = 10.76 f1? = 1550 in.?; 1 hectare = 104 m? = 2.471 acrea 

1 f? = 929 em?; ] in.? = 0.452 em? = 1.273 x 10° circular mila 

Teame 3 

m= 35.31 ft? = 6.102 X 10' in. ae 

Uh? = 0.02832 m; 1 U.S. gallon = 231 m3; | liter = 1.000028 x 10-3 m? = 
61.02 in.? 

Time and F, 7 

1 year = 365.24 2 days = 8.166 x 10° hours = 5.259 x 10% min Sr xie 8 

1 eddereal day (period of of earth's revolution) = 86,164 s; 1 Herts (Hs) = 
leyele/s 


Speed 
ng = 3.281 ft/s = 3.6 km/hour = 2.237 mi/hour = 1.944 knot 


ass 
lkg = 2.205 Ib mase = 0.06852 slug 
Ub mass = 0.4536 kg = 0.03108 slug; 1 slag = 32.17 Ib mosa = 14.59 kg 


1 g/emë = 1000 kg/m? = 62.43 Ib mass/ft? = 1.940 slug/h? 

ise mase/ft? = 0.03108 alug/ft? = 16.02 kg/m? = 0.01602 g/cm? 
orce 

1 nowton (N) = 103 dynes = 0.1020 kg-wt = 0.2248 Ib 

1 ib (force) = 4.448 N = 0.4536 kg-wt == 32.17 poundals 


=.1.501 X 10-4 em Hg = 4.015 x 10- in. H,O = 10-5 bar 
1 lb/in? = 144 1b/ft? = 6895 N/m? = 5.171 em Hg = 27.68 in. H,O i 
atm = 406.8 in. H,O = 16cm Hg = 1.013 x 10° N/m? = 10,330 kg-wt/m 
= 2116 lb/ft? = 14, in.? = 160 torr 


1 La 2389 cal = 9.481 x 10-4 Bra = 0.7376 fb =107 ergs = 6.262 x 1018 eV 
Eaa 5 4186 J = 3.968 Btu = 3087 futb; 1 eV = 1.602 x 10—19 J 
whe = 3.6 X 108 J = 3413 Btu = 860.1 keal = 1.341 bp-br 


lbp = 2545 Btu/bhr : ee! 
= $50 ft-lb/sec = 745.7 W = 0.1782 kcal/s 
Elec ait? X10 *healfa = 341X 10? bp = 0.7376 ft-lb/s 
rge 
L paR (C) 0 chcoutontl: = 207 10 AA EELS (oe) 
aE) = 96,487 C; 1 electronic charge = 1.602 x 10—1°C 
ric Curre 
A ampere (A) —0, (eee) 
= 0.1 abampere (emu) = 2.998 X 10° otatamperes 
Tigetrie Potential Difference 
oy (emu) = 19-8 V; 1 statvolt (esu) = 299.8 V 


senate (esu) = 1 em = 1.113 x 10-12 F 


Megnene\/A™ = 1 Wh/m? = 10,000 gauss = 10° gamma 
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